Gene Dtpsy_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_1156 
Symbol 
ID7384588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp1218944 
End bp1220125 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID643654477 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002552629 
Protein GI222110365 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0686413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACC AGAGCATCAC GCGCCGAAAC ATCGGCGTTC TCACCGCCGC GCAGGCGCTC 
GGCGGCGCCA GCGCGCCCAT CGTGATGTCG CTGGGTGGGC TGGTCGGCCA GCAGCTTGCC
AAGAATTCGG CCTGGATCAC CTTGCCCGTG AGTCTGTTCG GCCTGGGTCT TGCCATCGGC
ACCTTGCCTG CCGCCTTCAT CATGCGGCAC CATGGCCGCC GCAACGGATA CGTGGTGGGG
GTCGGCTTCG GCGTGGCCTC GGGCCTGATC GCCGCGTTGG GCATCATGCT GGCCTCGTTC
TGGATCTTCT GCGCCGGCAC CTTCCTGGCG GGCTTCTACG GCGCGTATGT GCAGAGCTAC
CGCTTCGCAG CCGCCGACAC CGCCGAGGAC GCGCTTAAGG CCAAGGCCAT TTCCTGGGTC
ATGGTGGGCG GTCTCGCGGG CGCCATCATC GGGCCGCAGT TGGTGATCTT CACGCGCGAT
GCGGTAGCGG GCACGCCCTA CGTTGGCAGC TTCCTCAGCC AGGCGCTGCT GCCGCTGATC
GCCTTGCCGA TCCTGCTGAT GCTGCGCACG CCGAGCCAGA CCCAGGCCGA AGCAGTCGCC
GATAGCGGTC GGACGGTGCT GCAGCTCTTG GCGATGCCGC GCTATCTGCT GGCCGTGGCT
GCGGGCGTGG TGTCCTATGG GGTGATGGCG TTCGTGATGA CGGCCGCGCC GGTGGCGATG
GTCAACCACG GGCATTCGGT GGACAACGCC GCCCTAGGAA TACAGTGGCA CCTGCTGGCG
ATGTTCGGGC CGAGCTTCTT CACCGGGCGA CTGATGGTGC GCTACGGCAA GGAGCGCGTG
ACCGCCGTCG GCATGGTGCT GCTCGCCGCC TCCGGGGTGG TGGCCCTGGG CGGGCTCGGC
CTGTCCCACT TCTGGGGCTC GCTGGCGCTG TTGGGCATCG GCTGGAATTT GAGTTTCATC
GGCGCCACGG CGATGGTCAC CGACTGCCAC ACCCCGGCCG AGCGGGGCAA GGCGCAGGGC
ATGAACGACT TCTTCGTCTT CGCCGCCACG GCGGCCGTGT CGTTCCTCGC GGGGTCGATC
CTGCACAGCT CGGGCTGGCA AGCGGTCAAC TGGATGATCT TCCCGGCCTT GGCGCTGATC
TTGGTGCCGC TGCTGTGGCA GGGGCGGTAC GGTTGCAACT GA
 
Protein sequence
MTDQSITRRN IGVLTAAQAL GGASAPIVMS LGGLVGQQLA KNSAWITLPV SLFGLGLAIG 
TLPAAFIMRH HGRRNGYVVG VGFGVASGLI AALGIMLASF WIFCAGTFLA GFYGAYVQSY
RFAAADTAED ALKAKAISWV MVGGLAGAII GPQLVIFTRD AVAGTPYVGS FLSQALLPLI
ALPILLMLRT PSQTQAEAVA DSGRTVLQLL AMPRYLLAVA AGVVSYGVMA FVMTAAPVAM
VNHGHSVDNA ALGIQWHLLA MFGPSFFTGR LMVRYGKERV TAVGMVLLAA SGVVALGGLG
LSHFWGSLAL LGIGWNLSFI GATAMVTDCH TPAERGKAQG MNDFFVFAAT AAVSFLAGSI
LHSSGWQAVN WMIFPALALI LVPLLWQGRY GCN