Gene Mvan_2624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2624 
Symbol 
ID4643394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2772249 
End bp2773583 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content66% 
IMG OID639806106 
Productextracellular solute-binding protein 
Protein accessionYP_953438 
Protein GI120403609 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATCA CGCAATGGCC CACTTCGGGG CGGAGAAGCG GGCGCCCAGG CACCGCGTTC 
TCCGCCGTGA TGGCACTGGT CGCCGTCCTG GCCCTGGTGT TGACCGGGTG CGCGGGCAGC
GGCGGGCCCG AACAGGCCGA AGCCACCGGC ACCGGCGAGG TCTCCACCGA CGTCTCGGGC
ACCGTGCGGA TCCTGATGGA GAACGTGCCG GACACCGACA TCGTCAAGTC CATGGTGGCC
GACTTCAACA AGGAATACCC GGGCGTCGAG ATCAACATCG AGTCGCTGAC GTTCGATCAG
ATGCGCGACA AACTCGTGTC CTCGTTCCAG TCCTCGTCGC CCACCTACGA CCTGATCGTC
GTCGACAACC CGTGGATGGT CGACTTCGCC AACGCGAAGT TCCTGCAGCC CCTCGATGCC
CGCATCGACA GCACCCCGGA CTACGACGCC GCCGACTTCT TCAAGCCGCT CACCGACATC
ACCACTGTCG ACGGAGCCCG CTACGGTGTG CCGTTCTACA ACTACGCGCT CGGATACCTT
TACAACGCCG ACGATCTCAC GGCCGCCAAC CAGCAGGTGC CGACGACCCT CGACGAGCTG
GTCAGCACCA GCAAGGCGCT CAAGAGCGGC GACCGCGCCG GCATCGCGAT GCAGCCGCAG
CGTGGCTACA AGATCTTCGA AGAGTGGGGC AACTGGCTGT TCGCCGCGGG CGGATCGATC
TACGACGCCG ACGGCAAGAT CACGCTGAAC ACGCCGGAAG CCAAGCGGGC ACTCGAGGCT
TACATCGACA CCTACAACAC CGCCGCGCCG GCCAACAGCC TGAGCTGGGG CATGGACGAG
GCGCAGCGTT CGGTGTCGGC GAACCAGGCC GCGTCGATGA TCAATTACAA CTGGCAGCTG
CCCGCCCTCA ACGAACCGGG CTCCGGGCCG GCCGCAGGCA AGATCAAGCT CGCCACCATC
CCCGGCGGCA AGCAGGTACT GGGCTCATGG AGCTGGGCGA TCCCGGCCAA TTCGGCCACA
CCCGACGCGG CATGGGCGTT CGTCTCGTGG ATCACCGCCA AGCCCAACGA TGTCGTGCGC
ACCGAGAAGG GCGGCGCCGC GATCCGGCAG AGCACACTGC AGGACCCGGC CGTGCTGGGC
GGACAGTTCG GCGAGGAGTA CTACCGGACC GTCGAGCAGC TGCTTGCCAA CGCGGCTCCG
CTGACCCAGG GGCCCAGCGG TGAGGAGATG ATCCAGGCAG TCGGCACCGA GCTCAACGAA
GCGGTCGCCG GCAAGAAGAG CGTCGACGAC GCACTGGCCG CCGCACAGGC CGAGGCAGAG
AAGATCCAAG GCTAG
 
Protein sequence
MRITQWPTSG RRSGRPGTAF SAVMALVAVL ALVLTGCAGS GGPEQAEATG TGEVSTDVSG 
TVRILMENVP DTDIVKSMVA DFNKEYPGVE INIESLTFDQ MRDKLVSSFQ SSSPTYDLIV
VDNPWMVDFA NAKFLQPLDA RIDSTPDYDA ADFFKPLTDI TTVDGARYGV PFYNYALGYL
YNADDLTAAN QQVPTTLDEL VSTSKALKSG DRAGIAMQPQ RGYKIFEEWG NWLFAAGGSI
YDADGKITLN TPEAKRALEA YIDTYNTAAP ANSLSWGMDE AQRSVSANQA ASMINYNWQL
PALNEPGSGP AAGKIKLATI PGGKQVLGSW SWAIPANSAT PDAAWAFVSW ITAKPNDVVR
TEKGGAAIRQ STLQDPAVLG GQFGEEYYRT VEQLLANAAP LTQGPSGEEM IQAVGTELNE
AVAGKKSVDD ALAAAQAEAE KIQG