Gene Dtpsy_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2000 
Symbol 
ID7382119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2137886 
End bp2139292 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content72% 
IMG OID643655318 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002553456 
Protein GI222111192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0919237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCT TACCCGTGGG TGCGCCGCCC CTGGCCGACG GCCTGCCGCA GCCCGCACGC 
CGCCAGGCGA TGCTGGTCAT CGTTCTCGGG CTCACGCTGG CGGTGCTCGA CAGCAGCATC
GTCAACCTGG CGCTGCCCGA CATCGCACGC CAGTTGCAGT CGGGCGCGGC GCAGAGCGTG
TGGGTGGTCA ACGCCTACCA GCTCGCCACG CTGGTCGTGC TGCTGCCGCT GGCCGCGCTG
GGTGAGCGCG TGGGCTACCG GCGCGTGTAC TTGGTGGGCA TGGCGCTGTT CGCGCTGGCG
TCGGTGGGCG CCATGCTGGC GGCCAGCATG CCCGCCTTGA TTGCCGCGCG GGCGTTGCAG
GGCCTGGGCG CCGCGGGCGT GATGGCCGTG AATGCGGCGC TGGTGCGCCT GATCTATCCG
CGCGCACAGC TGGGCCACGG CATGGCCATC AATTCGCTGG TGGTGGCCAC CGCGTCCATG
GCGGGGCCGT CGGTGGCGGC GGCCATCCTG TCGGTGGCGT CGTGGCCCTG GCTGTTTGCC
ATGAACCTGC CCTTGGGCGT GGGAGTCTGG TGGCTGGGGT GGCGCGCGCT GCCGGTCAAT
CCTCCATCCG CCAACCATGC GCCGCGCTTT TCCGCCATCG ACGTGCTACT CAACGGCGCC
ATGTTCACGC TGCTGTTCCT GGGCGGGGAG CAACTGGGCG TGCGCAGCGC GGCGCAGGGC
GGCAGCGCGG CTACGGGTGC GATCCTGCTG GCCGCGGGTG TGGCCGTGGG GGCGGTGTTC
CTGTGGCGCC AGCGCGGCTT GGCGGCGCCG CTGTTTCCGG TGGACCTGCT GCGCATTCCG
GTGTTCGCCC TGTCGATGGG CTCGTCCGTG GGGGCGTTCT GCGCGCAGAT GCTGGGCTTT
CTGTCGCTGC CCTTCCTGCT GTTGGAGGCG CAGGGCCGCA CCCACTTGGA GGCCGGACTG
CTCATTACGG CCTGGCCCCT GGCCACCGCC GTGGTGGCGC CGCTGGCGGG CCGATTGATT
GGCCGCTACC CGGACGGCCT GCTCGGCGGC ATTGGCATGG CGGTGTTTGC TGCCGGCCTG
GTCTCGCTGG GCCTGATGCC CGCGCAGCCC GCGGACTGGA ACGTGGCCTG GCGCATGGCG
CTGTGCGGTG CGGGCTTTGC GCTGTTTCAG TCGCCCAACA ATCACACCAT CGTCACCTCG
GCCCCGCTGC ACCGCAGCGG CGCGGCCAGC GGCATGCTGG GCACCGCGCG CCTGACGGGC
CAGACACTGG GCGCCGTGTC GCTCGCGGCC ATCTTCGCCC TGCGGCCGGG GCACGATGGA
AGCGCGGAGT CGCTGGCACT GCTGGTGGCA GGGGCGTGCG CGGTGGTGGC AGGGGTGTGC
AGCTCGCTGC GGGTGAGGCA GCGGTAA
 
Protein sequence
MSGLPVGAPP LADGLPQPAR RQAMLVIVLG LTLAVLDSSI VNLALPDIAR QLQSGAAQSV 
WVVNAYQLAT LVVLLPLAAL GERVGYRRVY LVGMALFALA SVGAMLAASM PALIAARALQ
GLGAAGVMAV NAALVRLIYP RAQLGHGMAI NSLVVATASM AGPSVAAAIL SVASWPWLFA
MNLPLGVGVW WLGWRALPVN PPSANHAPRF SAIDVLLNGA MFTLLFLGGE QLGVRSAAQG
GSAATGAILL AAGVAVGAVF LWRQRGLAAP LFPVDLLRIP VFALSMGSSV GAFCAQMLGF
LSLPFLLLEA QGRTHLEAGL LITAWPLATA VVAPLAGRLI GRYPDGLLGG IGMAVFAAGL
VSLGLMPAQP ADWNVAWRMA LCGAGFALFQ SPNNHTIVTS APLHRSGAAS GMLGTARLTG
QTLGAVSLAA IFALRPGHDG SAESLALLVA GACAVVAGVC SSLRVRQR