Gene Mvan_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1139 
Symbol 
ID4646703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1211328 
End bp1212980 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content66% 
IMG OID639804638 
Productzinc finger SWIM domain-containing protein 
Protein accessionYP_951981 
Protein GI120402152 
COG category[S] Function unknown 
COG ID[COG4715] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.256553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTAT CGGAGACGAC GCTGCTTGGC GTCGCCGGCG ACCGGGTCTT TGCCCGCGGT 
GAGGACTACG TCCGCTACGT CCGAGGTCTG CGGGTCACCA CCGACAAGGC CCATGCGTCG
GTCCAGGCCA AGAGGGTCTA CACCGTGGAA CTGGACTGGT CCGGCCCGCT ACCCGCCGGC
TCCTGTACCT GCCTTCATCA CGCAGACGGT CACTTCTGCA AGCATCTGGT CGCGGTCGGG
CTCGCGGTGA TCGACCGCAG CGAAGGCACT GCCGACGTCA CGCCGGAATC TGCGGTGCAG
GCAGCGGTGG AGGCGATGGA TGTCCACGAG CTTCGCGACC TCGTCATGAT GCTCGCACAG
CGTGACGGCG AAGTTCGCCG CATGCTGGAG GTACGTGCGA CGACTGAGTC CGGGGACGAC
ACCGCCGCCA AAGTCGAGTT TGAGAGCTAC GTGCGAAATG CGTTGCAGTT CTATGGCTTC
ATCGACTACC GGGAATCGTA CGCGGTGGCT GAAACGGCCG GACAAGTGCT CGACGAGCTG
GAGAATCACT TGAACGATGG GGCCGCCGAG ATCGTGCGGC CGGCCCTGCT GTCCGCGTTG
ACGCTGCTGC GCTCCATCAC CGAGCATGCC GACGACTCGT CGGGAGCTAT CGGTGGTGAA
TGCCAGCGGG CGGCCGAGTT GTACGCGCGG GCCTGTCGGC AGGGTGCGCC CGACCCGGTC
GAACTCGCCA TCTGGTTGGT GAGGTTCCGT GCTGACTCGC CGGGATGGCC GGATCTGGCG
TTGGCGGACT TTGTCGACGC GTTCGACGAC CACGCGCTCG CTGTCTACCG CCGGGCGGTA
TCCGAACTCG ATCACAAACT CGCAGAAGGC GATCGTTGGA ACCGCTTCGA AGTCGACACC
ATGCTGCTGG AGCTCGCTGA CCACGACGGC GATGTCGATC GGGCGGTCCA GCTGCTGAAC
GAGCGCGAAC ATCCTCAGTA CGGGGCGATC ATCGCCCGGT TGCGGGCAGC CGGGCGTGAC
GACGAGATCG ACACGTGGAT CGACCGCGCC GTTGCGGAGG GACGCGTCAG CGGGCACAGC
GGCGGCAACG CGTACTGGCT GAGCCCCACC GACGTCGCCC TGACGTATCA AGAACGAGGC
CGCGTCGAGG ACGCGATCGC TGTCCTGCGC GCCGACTTCG TCAGGCAGCC CTCGGTCCGT
TCCTACCAGG CGCTGTGCGG CTTCGCCGCA GGCATCGACC GCGGCGAAAC TGAACGCGAG
TGGGCGTTCG ATCATGCGCG GCAGCTGGCT TCGGACCGTC CGGGGGGCGG CGCGGTTCTG
GTGCAGCTGT TGTTGAGCGA AGGCGATGTC GATGCCGCCT GGGAGGCCGC CGACCGATAC
GGCCCCGGCG GGGCGTGGAA GGAGCTCGCC GACCGCGGCG CCGATGCGCG CCCCGTAGCC
GCTGCCGACC TGTACCGGCC GCAACTCAAG GAGGACCTTC GGCACGCCAA CACCAGGCTC
TATCCCGGTA TCGCCGCGAC TTTGGCGACC ATGGCCGAGC TCTACGAACG TGGCGGGCGC
AGTGACGATT TCGCGGTTCT TATCGCGCAG ATCCGCCAGG ATTACGGTCG GCGACCGTCG
CTGATGAAGG CGCTCAAGGC CAAAGGCCTC TGA
 
Protein sequence
MPLSETTLLG VAGDRVFARG EDYVRYVRGL RVTTDKAHAS VQAKRVYTVE LDWSGPLPAG 
SCTCLHHADG HFCKHLVAVG LAVIDRSEGT ADVTPESAVQ AAVEAMDVHE LRDLVMMLAQ
RDGEVRRMLE VRATTESGDD TAAKVEFESY VRNALQFYGF IDYRESYAVA ETAGQVLDEL
ENHLNDGAAE IVRPALLSAL TLLRSITEHA DDSSGAIGGE CQRAAELYAR ACRQGAPDPV
ELAIWLVRFR ADSPGWPDLA LADFVDAFDD HALAVYRRAV SELDHKLAEG DRWNRFEVDT
MLLELADHDG DVDRAVQLLN EREHPQYGAI IARLRAAGRD DEIDTWIDRA VAEGRVSGHS
GGNAYWLSPT DVALTYQERG RVEDAIAVLR ADFVRQPSVR SYQALCGFAA GIDRGETERE
WAFDHARQLA SDRPGGGAVL VQLLLSEGDV DAAWEAADRY GPGGAWKELA DRGADARPVA
AADLYRPQLK EDLRHANTRL YPGIAATLAT MAELYERGGR SDDFAVLIAQ IRQDYGRRPS
LMKALKAKGL