Gene Mvan_5748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5748 
Symbol 
ID4644203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6136199 
End bp6137524 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID639809224 
Productgeneral substrate transporter 
Protein accessionYP_956519 
Protein GI120406690 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.438038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.252794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCG ACATCCGCCG GGTGGTGACC GGGGCGTCGA TCGGCAATGC CGTCGAGTGG 
TTCGACTTCG CCATCTACGG ATTCCTCGCG ACGTTCATCG CCGCCCACTT CTTCCCGGCG
GGCAACGACA CCGCCGCGCT GCTCAACACC TTCGCGATCT TCGCCGCGGC ATTCTTCATG
CGACCGCTCG GCGGCTTCTT CTTCGGCCCG CTCGGCGACC GCATCGGCAG GCAGAAGGTG
CTCGCGGTGG TGATCCTGCT GATGTCGGCG GCCACACTCG GTATCGGGCT GCTGCCCACC
TATGAGGCCA TCGGTGTGGC CGCGCCGATG CTGTTGCTGG TCCTGCGGTG CCTGCAGGGC
TTCTCCGCGG GCGGTGAATA CGGTGGCGGC GCGGTCTACC TGGCGGAGTT CGCCAGCGAC
GCGCGCCGCG GCCTGACCAT CACGTTCATG GCCTGGTCCG GGGTGCTGGG CTTCCTGATC
GGCTCGGTCA CGGTGACCCT GCTGCAGGCG CTGCTGCCCG CCGCGGCGAT GGAGAGCTAC
GGCTGGCGCA TCCCGTTCCT GATCGCAGGC CCGCTCGGGC TGGTCGGTCT CTACATCAGG
CTGCGCCTCG GCGACACCCC GCAGTTCGCC GAACTCGACA AGGCCGAGAA GACCGCCGAC
TCGCCGCTGC GGGAGGCCGT CACCACCTCG TGGCGACAGA TCATCCAGGT GATCGGTCTC
TTCATCGTCT TCAACATCGG CTACTACGTT GTATTCACGT TCCTGCCAAC CTATTTCATC
AAAACACTGA GGTTCTCGAA GTCGGAGGCT TTCGTCTCGA TAACGCTGGC CTGCCTGGTG
GCGCTGATCC TGATCCTGCC GCTGGCGGCG CTGTCGGACC GGATCGGCCG GCGTCCGCTG
CTCATCGGCG GCGCGGTGTC GTTCGCCGTG CTCGGCTACC CGTTGTTCCT GCTGCTGACC
TCCGGGTCGC TGGTCGCGGC GATCACTGCG CACTGCCTGC TCGCGGCGAT CGCGTCGGTG
TACATCTCCA GTGCGGTGTC GGCGGGCGTC GAGTTGTTCG CGACCCGGAT CCGGTTCAGC
GGGTTCTCCG TCGGCTACAA CGTCTGCGTC GCGGTGTTCG GCGGCACCAC GCCCTACGTC
GTCACCTGGC TGACGGCTGC CAGCGGCAAC GCGATCGCGC CCGCGTTCTA TCTGATCGCG
GCCGCCGTCG TCTCACTGGC CGCCGTGCTC ACCCTGCGGG AGTCGGCCGG TCGCGCACTG
GCACAGGTCC AGGAGCGCCC GGCTAATGTG GGCTCAGGCA TCCACGAAGG GGAGATGCGT
AGATGA
 
Protein sequence
METDIRRVVT GASIGNAVEW FDFAIYGFLA TFIAAHFFPA GNDTAALLNT FAIFAAAFFM 
RPLGGFFFGP LGDRIGRQKV LAVVILLMSA ATLGIGLLPT YEAIGVAAPM LLLVLRCLQG
FSAGGEYGGG AVYLAEFASD ARRGLTITFM AWSGVLGFLI GSVTVTLLQA LLPAAAMESY
GWRIPFLIAG PLGLVGLYIR LRLGDTPQFA ELDKAEKTAD SPLREAVTTS WRQIIQVIGL
FIVFNIGYYV VFTFLPTYFI KTLRFSKSEA FVSITLACLV ALILILPLAA LSDRIGRRPL
LIGGAVSFAV LGYPLFLLLT SGSLVAAITA HCLLAAIASV YISSAVSAGV ELFATRIRFS
GFSVGYNVCV AVFGGTTPYV VTWLTAASGN AIAPAFYLIA AAVVSLAAVL TLRESAGRAL
AQVQERPANV GSGIHEGEMR R