Gene Mvan_5601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5601 
Symbol 
ID4646145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5979147 
End bp5980097 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content65% 
IMG OID639809074 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_956372 
Protein GI120406543 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGCT CCCGACGAAC CAGGCGCGCA GCTGTGGCGG TTGCCATCGC GCTTCTCGCA 
GCGGTGCTCA GCGCCTGCGG CAGCTCCAAT CCACTCGGTG GCGGCGAGAT CTCCGGTGAC
CTCAAGTCGA TCAAGGTGGG TTCGGCGGAC TTCACCGAAT CGAAGATCAT CGCCGAGATC
TACGCCCAGG CGCTAGAGGC CAACGGGTTC ACGATCTCCC GCCAGTTCGG TATCGGCAGC
CGCGAGACGT ACATCCCCGC GGTGCGGGAC CACTCGATCG ACCTGATCCC GGAGTACACC
GGCAACCTGC TGCAGTACTT CGACCCCGAG AGCGCTGCCA CGACACCGGA TTCGGTGCTG
CTCGGCCTGT TGAAGGCGCT TCCCGGCGAC CTGTCGATCC TGTATCCGTC GCCCGCGGAG
GACAAGGACA CCCTCGCGGT GTCGGCGGAG ACCGCGCAGC GCTGGAACCT GAAGTCGATC
GCAGACCTGG CTGCACATTC CGCTGAGGTG AAAGTCGGTG CGCCGTCGGA GTTTCAGACC
CGGCAGACCG GTCTGGTAGG GCTCAAGGAG AAGTACGGCC TGGACATCGC GCCGGCGAAC
TTCGTCGCGA TCAGCGACGG CGGCGGTCCC GCGACGGTCA AGGCGCTGAC CGACGGAACG
GTCACCGCGG CCAACATCTT CAGCACGTCA CCGGCGATCG AACGCAGCGC GCTGGTGGTG
TTGGAGGATC CGAAGAACGT GTTCCTGGCC GCGAACGTGG TGCCGCTGGT GGCCTCGCAG
AAGATGTCGA ACGAACTCAA GACCGTGCTG GACGCCGTCA GTGCCAAGCT GACCACCGAG
GCCCTGATCG AGTTGAACAC CTCGGTCGAG GGCAATCAGG GAGTCGACCC CGACGAGGCG
GCGCGGAAGT GGATATCCGA CAACGGCTTC GACACGCCCA TCGGGAAGTA G
 
Protein sequence
MICSRRTRRA AVAVAIALLA AVLSACGSSN PLGGGEISGD LKSIKVGSAD FTESKIIAEI 
YAQALEANGF TISRQFGIGS RETYIPAVRD HSIDLIPEYT GNLLQYFDPE SAATTPDSVL
LGLLKALPGD LSILYPSPAE DKDTLAVSAE TAQRWNLKSI ADLAAHSAEV KVGAPSEFQT
RQTGLVGLKE KYGLDIAPAN FVAISDGGGP ATVKALTDGT VTAANIFSTS PAIERSALVV
LEDPKNVFLA ANVVPLVASQ KMSNELKTVL DAVSAKLTTE ALIELNTSVE GNQGVDPDEA
ARKWISDNGF DTPIGK