Gene Dtpsy_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2006 
Symbol 
ID7384998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2143883 
End bp2144899 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content69% 
IMG OID643655324 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002553462 
Protein GI222111198 
COG category[R] General function prediction only 
COG ID[COG2130] Putative NADP-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.17413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGCA ACCAACAGAT CCTCCTCGAC AACCGCCCCC AGGGTGAAGC CACGGTGGGC 
AACTTCCGCC TGGTCACCAC CGATACGCCT GCACTGCAGG ACGGCCAGGT GCTGGTGCGC
CACCATTACC TGAGCCTGGA CCCCTACATG CGCGGGCGCA TGAACGACAG CAAGAGCTAC
GCCGCCAGCC AGCCGCTGGG CGAGGTGATG ATCGGCGGGA CGGTGGGCGA GGTCGTCGAG
AGCCGCCACC CGGTCTATGC CGTGGGCGAC AAGGTGGTCG GCATGGGCGG CTGGCAGGAG
TACAGCGTGG CCGACGGCAA CACGCCCGGC ATGCTGCGCA AGGTGGATAC CACCCACGTG
CCGCTGTCGG CCTACCTGGG GGCCGTGGGC ATGCCCGGCG TGACCGCCTG GTACGGCCTG
GTGAAGATCA TCGCCCCGAA GGCGGGCGAG ACCGTGGTCG TGAGCGCCGC CAGCGGCGCC
GTGGGCAGTG CCTTCGGCGC GCTGGCCAAG GCGCGCGGCT GCCGCGTGGT GGGCATTGCC
GGCGGCCCGG ACAAGTGCCG CTATGTGACT GAAGAGCTGG GCTTTGACGC CTGCATCGAC
CACCGTGCGA ACGGCGATTT GAAGAGCATG GCCCGCGCGC TCAAGGAAGC CTGCCCGGAC
GGCATCGACG GCTACTTCGA GAACGTGGGC GGCTACATCC TGGACGCCGT GCTGCTGCGC
GCCAACGCGT TCGCCCGCGT GGCCGTGTGC GGCATGATCG CCGGCTACGA CGGCCAGCCG
CTGCCACTGC AGAACCCGGC GCTCATCCTC ATCAACCGCA TGAAGGTCGA GGGCTTCATC
GTCAGCGAAC ACATGGAGGT GTGGCCCGAG GCGCTCAAGG AACTGGGAGG CCTGGCGGCC
AGCGGCAAGC TGCGCCCGCG CGAGACCATC GCCCAGGGCC TGGCCGCGGC GCCCGAGGCG
TTCCTGGGCC TGCTCAAGGG CAAGAACTTC GGCAAGCAAC TGGTGAAGCT GGTTTGA
 
Protein sequence
MPRNQQILLD NRPQGEATVG NFRLVTTDTP ALQDGQVLVR HHYLSLDPYM RGRMNDSKSY 
AASQPLGEVM IGGTVGEVVE SRHPVYAVGD KVVGMGGWQE YSVADGNTPG MLRKVDTTHV
PLSAYLGAVG MPGVTAWYGL VKIIAPKAGE TVVVSAASGA VGSAFGALAK ARGCRVVGIA
GGPDKCRYVT EELGFDACID HRANGDLKSM ARALKEACPD GIDGYFENVG GYILDAVLLR
ANAFARVAVC GMIAGYDGQP LPLQNPALIL INRMKVEGFI VSEHMEVWPE ALKELGGLAA
SGKLRPRETI AQGLAAAPEA FLGLLKGKNF GKQLVKLV