Gene Mvan_5224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5224 
Symbol 
ID4644325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5593845 
End bp5595071 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content73% 
IMG OID639808699 
Productmajor facilitator superfamily transporter 
Protein accessionYP_956001 
Protein GI120406172 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.469666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.280666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTCG TCGCGGTCGC GGCGGCGACC ACCCGCCCGA GTGGTCGCAG CGCCCGCCGT 
TGGCTCGCGG TTGCCGCGGC GACCTTCGCC ATCGCGTGGG GCGGAAACGA GTTCACCCCG
CTGCTGGTGA TGTATCGGAC CCAGGACGGC TTCTCCGCGC TGACCGTCGA TCTGCTGTTG
TTCGCCTACG TGCTCGGCAT CGTGCCTGCG CTGCTCATCG GTGGGCCGCT GTCCGACCGC
TTCGGTCGCC GGCCGCTGAT GTTGCCCGCG CCGGTGCTCG CCGCCGTCGG GTCGGCGATC
CTGGCGCTCG GCGCACAGTC GGCGCCGGTG CTGGGAGTCG GACGGGTGTT CAGCGGCGTC
GCCCTCGGCC TCGCGATGGC CGTCGGCGGC AGTTGGATCA AGGAGCTGTC CAGCCCGCCC
TGGGAGGACG GTGACGCGGG CGCCCGTCGC GCCGCGATGA GTCTGACCGC CGGGTTCGGG
CTGGGCGCCG GCACCGCAGG TGTGCTCGCC GAATGGGGTC CGGCGCCGAC GGTTCTTCCC
TATGCGGTCA ACATCGCGAT GGCGCTCGCC GCGGCGGTGT TTGTGAGCAC CGCGCCCGAG
ACACGGACCC GCCACGACTC GGGCCGCCCG TGGTGGACGG ACCTCGCGGT TCCGGGTGCG
TCGCACCGCC GCTTCCTCTT GGTGGTCGTC CCCGTCGCTC CGTGGGTGTT CGGCGCGGGA
GCGACGGCCT ATGCGGTGCT GCCCGCATTG ATGGCGGGGC GGGTGTCGTC GGCGCCCATC
GCGTTCTCGG CGTTGATGTG CCTCGTCGCG CTCGGCGTCG GGTTCACCGT CCAGCAGTTG
GGCCGCCACC TGGGCGCCGG TGGGCGCCGC GGAGTGGTCA CCGCGCTGGC GCTGCTGGTC
GTCGGGATGC TGCTGGCCGG CTGGGCGGCG GCGGTGTTGA CGGTGTGGTC GGCACTGGTG
GCGGCGGCGG TGCTCGGTGC CGGCTACGGG ATGGCGCTGT TGGCGGGCCT GCAGGAGATC
CAGCGCATCG CCGGCCCCGA CGACCTCGCC GGTCTGACCG CGGTGTTCTA CAGCCTCAGC
TACCTGGGCT TCGCGGTGCC TGCGGTGCTG GCGTTCGCGG TGCGATCGTT CAGCTATCCG
GCGATGTTCG GCTTCGGGGC GTTCGCCGCG GCGGTGTGTC TGCTCGTCGC GGTGCTGGGA
TCCCGGCGGA CGGCCGCAAT AAGCTGA
 
Protein sequence
MTVVAVAAAT TRPSGRSARR WLAVAAATFA IAWGGNEFTP LLVMYRTQDG FSALTVDLLL 
FAYVLGIVPA LLIGGPLSDR FGRRPLMLPA PVLAAVGSAI LALGAQSAPV LGVGRVFSGV
ALGLAMAVGG SWIKELSSPP WEDGDAGARR AAMSLTAGFG LGAGTAGVLA EWGPAPTVLP
YAVNIAMALA AAVFVSTAPE TRTRHDSGRP WWTDLAVPGA SHRRFLLVVV PVAPWVFGAG
ATAYAVLPAL MAGRVSSAPI AFSALMCLVA LGVGFTVQQL GRHLGAGGRR GVVTALALLV
VGMLLAGWAA AVLTVWSALV AAAVLGAGYG MALLAGLQEI QRIAGPDDLA GLTAVFYSLS
YLGFAVPAVL AFAVRSFSYP AMFGFGAFAA AVCLLVAVLG SRRTAAIS