Gene Mvan_5839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5839 
Symbol 
ID4643421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6225506 
End bp6226867 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content67% 
IMG OID639809315 
Productgeneral substrate transporter 
Protein accessionYP_956610 
Protein GI120406781 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.450465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTATGC CCACCACCCA CGAAGACCCA CCGCCGGGAG TCCTGAAGAA AGCCATCTCC 
GCATCGGCGA TCGGCAACGC GACCGAATGG TTCGACTACG GGATCTACGC CTACGGGGTC
TCCTACATCT CCGCGGCGAT CTTCCCCGGG GACGCCGCGA ACGCGACGCT GCTGGCCTTG
ATGACGTTCG CCGTCTCGTT CCTGGTCCGT CCGCTCGGCG GATTCGTCTG GGGGCCGCTC
GGTGACCGTC TGGGCCGCAA GCGGGTGCTG GCCATCACCA TCGTGCTGAT GGCCGGCGCC
ACGTTCTGCG TGGCCCTGGT GCCCACCTAC GACATGATCG GCATGTGGGC GCCGTTCCTG
CTGGTGCTGC TCCGGATGAT CCAGGGCTTC TCGACCGGCG GTGAGTACGG CGGCGCCGCG
ACGTTCATGG CCGAGTACGC GCCGAACAGG CGCCGCGGGC TGCTCGGCAG TTTCCTCGAG
TTCGGCACCC TCGGCGGGTT CTCTCTGGGC GCGTTGCTGA TGCTGGGTTG CTCGCTGGTT
CTGGGCGACG AGCAGATGCA CGCGTGGGGC TGGCGGCTGC CGTTCCTGGT GGCCGCGCCG
CTCGGCCTGA TCGGTCTCTA TCTGCGATCG CGGCTGGAGG ACACCCCGGT GTTCCGGGAG
CTCGAAGCGA AGGGTGGGAC GGAACCGGAG ACCACCACCC AGTTCCGCGA CCTGCTGGGC
CGGTACTGGC GACCGATCCT GCAGCTCGGC GGACTGGTGG TCGCGCTGAA CGTCGTGAAC
TACACCCTGC TGTCCTACAT GCCGACGTAC CTGGAGAACA GGATCGGGCT GTCACCGGAC
CAGTCGCTGA TCGTGCCCGT GATCGGCATG CTGTCGATGA TGGTCTTCGT CCCATTCGCC
GGGCTGCTCA GCGATCGGGT GGGACGGAAA CCGCTGTGGT GGTTCTCGTT GATCGGCCTG
TTCGTCGCCG GCGTGCCGAT GTTCCTGTTG ATGGGCACCA ACCTGTGGGG TGCGGTGATC
GGCTTCGCCG TCCTCGGCCT GCTGTACGTG CCACAGCTGG CGACGATCTC GGCGACGTTC
CCGGCGATGT TCCCCACCCA GGTGCGCTAC GCCGGATTCG CCATCGCCTA CAACGTGTCG
ACGTCGCTGT TCGGCGGCAC CGCGCCGGCG ATCAATCAGT GGCTCACCGG CGAAACCGGC
GACCTGCTGT TCCCGGCGTA CTACATGATG GGCGCGTGCG TCATCGGCGC CATCGCGCTG
ATCAAGGTGC CCGAGACCGC ACGTTGTCCG ATCGGCGGCA CCGTCACGCC CGGCACGGAG
GAGGCGGCGG ATCCGGTGCC GTTCGAGAAG CAGAACGCCT GA
 
Protein sequence
MGMPTTHEDP PPGVLKKAIS ASAIGNATEW FDYGIYAYGV SYISAAIFPG DAANATLLAL 
MTFAVSFLVR PLGGFVWGPL GDRLGRKRVL AITIVLMAGA TFCVALVPTY DMIGMWAPFL
LVLLRMIQGF STGGEYGGAA TFMAEYAPNR RRGLLGSFLE FGTLGGFSLG ALLMLGCSLV
LGDEQMHAWG WRLPFLVAAP LGLIGLYLRS RLEDTPVFRE LEAKGGTEPE TTTQFRDLLG
RYWRPILQLG GLVVALNVVN YTLLSYMPTY LENRIGLSPD QSLIVPVIGM LSMMVFVPFA
GLLSDRVGRK PLWWFSLIGL FVAGVPMFLL MGTNLWGAVI GFAVLGLLYV PQLATISATF
PAMFPTQVRY AGFAIAYNVS TSLFGGTAPA INQWLTGETG DLLFPAYYMM GACVIGAIAL
IKVPETARCP IGGTVTPGTE EAADPVPFEK QNA