Gene Dtpsy_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2011 
Symbol 
ID7385003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2148740 
End bp2150098 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID643655329 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002553467 
Protein GI222111203 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.805115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCC GGCCGCAGCC GGCGCACACG GCCCACGGCA GCCGTTACAA CCAAGGGTCG 
CATTCACCCG GCCCCACGCC CGTCGCTCCC ATGTCCTCTC AAGATTCCTC ACCGGCCCGC
TGGTACTTCG GCTGGAACAT CGTGGCCGCG GCCACCGTGC TCACCGCCCT CACCGTGGGC
CTGCGCCTGG GGCTGGGCCC GCTGTTCCTG CCCATGACCG AGGATCTGGG CTTTTCGCGC
AGCCTGCTGT CGGCCATCGT CGCCGTGGGC ATGCTGTGCT ACGGCGCGGC CATGCCGCTG
GCGGGCTGGC TGGTGGCGCG TCGGGGCACG CGCTTCGTGC TGCTGGCGGG CACGGTCCTG
CTGGTGGCGT CGACACTCTG GGCAGTGAAC ACGCGCACGC CGCTGGGGCT GCTGCTGAGC
TTCGGCATCC TCATGTCCGT GGGCGCGGGA TTCACCAGCC CGGTGGCACT CACGCCGGTC
ATCAGCCGCT GGTTCAACCG CCGCCGGGGC ATGGCGCTGT TCTTCTTGTC CACGGGCTCC
ATGGCCGGCA TCGCCATCAT GACGCCCGCA CTGGGCATCG CGCTGCAGCA TCTGAGCTGG
CAGGCCACGC TGCTGGGCTT TGCCGCGCTG TTCACCGTCA TCACCGTGCC CACGGCGTTG
TGGGTGATCC GCGACGAGGC GCCTGCCGAT GGCGATGCCC CGCCGCCCGG CAGCCCCGTT
CGTGCGGCGA ACCCGGTGGC TCCCGCAGGG CAGCACTACA CGGTGCTGCA GGCCATGCGC
ACCGGCACCT TCCTCAAGAT CACGCTGGGC CTGTTCGCCT GCGGCTTCAG CATGAACCTG
CTGGGCACGC ACGGCATGCC CATGCTGATG GACCATGGCT TTGACGCCAC CACCAGCGCC
TTCGGCATCG GCCTGATCGG GCTGGTGGCC ATTCCCAGCA CCATGGTGCT GGGGCGCCTG
GCCGACCGGC TGCCGCGGCG CAAGCTGCTG GCCGCCATCT ATTGCGTGCG CGGCGTGGGG
TTCTTCGCGC TGCTGCTGGC GGGCAATACC CTGGAGCTGT ACGGCACCTC GGTGATCGGC
GGCATTGCCT GGGCGGGCAG CATCGCGCTG TCGTCCGCCA TCCTGGCCGA CATGTTCGGC
GTGCGGCTGG TGGGCGTGCT CTACGGCTGG GCCTACCTGG GGCATCAGGT GGGGGCGATG
ATCAGCGCCT GGCTGGGCGG CTGGGGCTAT GAGCATTTCG GCACGCACTG GATCGCCTTC
GGCCTGTCGG GCGCACTGCT GATGGTGGCC GCGGGGGTGG CGCTGCTGCT GCCGGGGCGG
CGCCCCCCGG CGCTGCCCAC GCCCCAGGCG GCGGGCTGA
 
Protein sequence
MRRRPQPAHT AHGSRYNQGS HSPGPTPVAP MSSQDSSPAR WYFGWNIVAA ATVLTALTVG 
LRLGLGPLFL PMTEDLGFSR SLLSAIVAVG MLCYGAAMPL AGWLVARRGT RFVLLAGTVL
LVASTLWAVN TRTPLGLLLS FGILMSVGAG FTSPVALTPV ISRWFNRRRG MALFFLSTGS
MAGIAIMTPA LGIALQHLSW QATLLGFAAL FTVITVPTAL WVIRDEAPAD GDAPPPGSPV
RAANPVAPAG QHYTVLQAMR TGTFLKITLG LFACGFSMNL LGTHGMPMLM DHGFDATTSA
FGIGLIGLVA IPSTMVLGRL ADRLPRRKLL AAIYCVRGVG FFALLLAGNT LELYGTSVIG
GIAWAGSIAL SSAILADMFG VRLVGVLYGW AYLGHQVGAM ISAWLGGWGY EHFGTHWIAF
GLSGALLMVA AGVALLLPGR RPPALPTPQA AG