Gene Mvan_4890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4890 
Symbol 
ID4648823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5236428 
End bp5237639 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID639808361 
Productmajor facilitator superfamily transporter 
Protein accessionYP_955669 
Protein GI120405840 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA CCCTCCATCG CCCGTCCGGC CCCGTGTCGG TGGCCGCACC CGATCCGGTG 
GTGCGCCGGC TCGCCGTGGT CGCGCTCGCA CTCGGCGGAT TCGGGATCGG CACCACCGAA
TTCGTGGCAA TGGGTCTGCT TCCCGACATC GCGACCGGTA TGGGTGTCTC CGAACCCACA
GCCGGCCATG TCATTTCGGC CTACGCCCTC GGGGTCGTGG TCGGTGCCCC GGTGATCGCG
GCCGTCACGG CGCGGATGGC GCGGCGCAAG CTCCTGCTGG CGCTGATGGC GTTGTTCACC
ATCGGCAACC TGGCCAGCAT GCTGGCGCCG ACCTACGAGA CATTGATCGC GGCCCGGTTC
CTCGCGGGCC TGCCGCACGG CGCGTACTTC GGTGTCGCGG CCCTGGTCGC CGCGCACCTC
ATGGGTCCGC AGAACCGGGC CAAGGCGGTC GCCCACGTAC TGACCGGCCT GACGGTCGCC
ACGGTGCTCG GTGTACCGAT CGCGTCGTGG CTCGGCCAAT CCCTGGGCTG GCGAGCAGCT
TTCGGGTTGG TGGTGGGTGT CGGCCTGGTC ACCTTGACGG CGCTGTGGTG CTGGCTGCCG
TTCCAGCTGA AGTTCATGCG GGCCACCAGC CCGCTCACCG AACTCGGCGC GCTGCGCCGC
CCCCAGGTGT GGCTGGCGCT TCTCGTCGGG ATGATCGGCT TCGGCGGCAT GTTCGCCGTC
TACACCTACA TCACCACCAC CATGACCGAT GTGGCGGGCA TGCCGCGTGG CCTCGTCCCG
TTGGCGCTGA TGATGTTCGG CCTCGGCATG GTGCTCGGCA ACCTCGTCGG CGGCCGGCTG
GCCGACGGTT CGGTGGTCCG TGCGCTCTAC CTGTCACTGG GTGCGTTGTG CGGTGCGCTC
GCCCTCTTCG TCGTCGCGTC GCACAACCCG TGGACCGCGC TGCTGGTGCT GTTCCTCATC
GGGCTCACCG GTTCGGCGGT CGGCCCGGCG CTGCAGACCC GGCTGATGGA CGTCGCGCAC
GACGCGCAGA CTCTGGCTGC GGCGCTGAAT CATTCGGCGC TCAACATCGG CAACGCGACG
GGCGCGTGGG TCGGTGGCCT GGTGATCGCC GCGGGTCTCG GCTACACCGC CCCTGCCGCA
GCGGGCGCGG TGCTGGCGCT CGCCGGTCTC GCGGTGCTCA CGGTCTCGGT CCTGCTGCAG
AAACGCGGCT GA
 
Protein sequence
MTDTLHRPSG PVSVAAPDPV VRRLAVVALA LGGFGIGTTE FVAMGLLPDI ATGMGVSEPT 
AGHVISAYAL GVVVGAPVIA AVTARMARRK LLLALMALFT IGNLASMLAP TYETLIAARF
LAGLPHGAYF GVAALVAAHL MGPQNRAKAV AHVLTGLTVA TVLGVPIASW LGQSLGWRAA
FGLVVGVGLV TLTALWCWLP FQLKFMRATS PLTELGALRR PQVWLALLVG MIGFGGMFAV
YTYITTTMTD VAGMPRGLVP LALMMFGLGM VLGNLVGGRL ADGSVVRALY LSLGALCGAL
ALFVVASHNP WTALLVLFLI GLTGSAVGPA LQTRLMDVAH DAQTLAAALN HSALNIGNAT
GAWVGGLVIA AGLGYTAPAA AGAVLALAGL AVLTVSVLLQ KRG