Gene Mvan_5735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5735 
Symbol 
ID4644190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6125602 
End bp6126927 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content71% 
IMG OID639809211 
Producthypothetical protein 
Protein accessionYP_956506 
Protein GI120406677 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.139677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGG CATTCGGCGA TTGGATCGTG CACCGCCGCT GGTACGCCGG CCGCAGCCGC 
GAACTCGTCT CGGCCGAGCC TGCGGTGGTG ACCCCGCTGC GCGATGACCT CGACCACATC
CTGCTCGACG TGACCTACAC CGACGGCACC GTCGAGCGCT ATCAACTCGT GGTCAGGTGG
GCCGACAGTC CGGTGGCCGG CTTCGGTGAA GCCGCCACCA TCGGCACCGC CCTGGGCCCG
CAGGGGGAAC GGATCGCCTA CGACGCGCTG TTCGACCCCG ACGCCGCCCG CCATCTGCTG
CGCCTGGTCG ATGCGTCGGC CACCGTCGCC GATCTGAGGT TCACCAGGGA ACCGGGTGCC
ACGCTGCCGC TGTACGCGCC GCCGAAGGTG TCGAGCGCCG AGCAGAGCAA CACCAGCGTG
ATCTTCGGAA AAGACGCCAT GCTCAAGGTG TTCCGCCGGG TGACGCCGGG CATCAACCCC
GATATCGAGC TCAACCGGGT GCTCGCCCAG GCGGGCAATC GGCACGTCGC AAGGCTCCTC
GGTTCGTTCG AGACGTCGTG GGCGGGTCCG GGCACGGACC GCTGCGCGCT CGGCATGGTG
ACGGCCTTCG CCGCGAACAG CGCCGAAGGC TGGGACATGG CCACGGCCAG TGCCCGCGAG
ATGTTCGCCG ACGTGGTGGG CAGCGACTTC GCCGACGAGT CCTACCGGCT CGGGAACGCG
GTGGCCTCGG TGCACGCCAC CCTCGCCGAA GCCCTCGGTA CCTCGACCGA GCCGTTCCCG
GTCGACACCG TGCTGGCCCG GCTGCAGTCG GCCGCACGGT CCGCGCCGGA GCTCGCGGGC
CGCGCCGCGG CGGTCGAGGA ACGATACCGA CGGCTCGACG GGCGGGCGAT CACCGTGCAG
CGGGTACACG GCGACCTGCA TCTCGGTCAG GTGCTGCGCA CCCCGGACGA CTGGTTGCTC
ATCGACTTCG AAGGTGAACC CGGCCAACCG CTGGACGAAC GCAGGCGGCC GGACTCGCCG
CTGCGCGACG TGGCCGGCGT GCTGCGGTCC TTCGAGTACG CGGCCTACCA GAAGCTGGTG
GAGCTGGCCC CCGAACAGGA CGCCGACGGT CGACTCGCGG ACAGGGCGCG CAACTGGGTG
GACCGCAACA GCGCCGCGTT CTGCGCCGGG TACGCGGCGG TCGCAGGGGA CGACCCGCGC
CGGGACGGCG ACGTGCTGGC TGCCTACGAG CTCGACAAGG CGGTGTACGA AGCCGCTTAC
GAGGCCCGTT TCCGGCCGTC CTGGTTGCCC ATCCCGATGA GATCGATCGA CCGCATCCTG
GGCTGA
 
Protein sequence
MTLAFGDWIV HRRWYAGRSR ELVSAEPAVV TPLRDDLDHI LLDVTYTDGT VERYQLVVRW 
ADSPVAGFGE AATIGTALGP QGERIAYDAL FDPDAARHLL RLVDASATVA DLRFTREPGA
TLPLYAPPKV SSAEQSNTSV IFGKDAMLKV FRRVTPGINP DIELNRVLAQ AGNRHVARLL
GSFETSWAGP GTDRCALGMV TAFAANSAEG WDMATASARE MFADVVGSDF ADESYRLGNA
VASVHATLAE ALGTSTEPFP VDTVLARLQS AARSAPELAG RAAAVEERYR RLDGRAITVQ
RVHGDLHLGQ VLRTPDDWLL IDFEGEPGQP LDERRRPDSP LRDVAGVLRS FEYAAYQKLV
ELAPEQDADG RLADRARNWV DRNSAAFCAG YAAVAGDDPR RDGDVLAAYE LDKAVYEAAY
EARFRPSWLP IPMRSIDRIL G