Gene EcE24377A_2293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2293 
SymbolpduQ 
ID5586231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2257806 
End bp2258918 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content51% 
IMG OID640925958 
Productpropanediol utilization protein PduQ 
Protein accessionYP_001463353 
Protein GI157158598 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACTT TTTCACTGCA GACGCGTTTA TACAGCGGAC CGGGAAGTCT CAAGGTAATT 
AGCCGTTTCT CTCATAAACA CATCTGGATT ATTTGCGACG TCTTTCTGGC ACGCTCACCA
CTCATTAACA CGTTGCATCA GGCGCTGCCC GACAGTAATC GTCTTAGTAT TTTTAGCGAA
ATAACGCCCG ACCCAACTAT CCAGACCGTG GTGAAAGGCA TTGCGCAAAT GCAGACTTTA
CGCCCGGATG TGGTTATCGG TTTTGGCGGC GGTTCTGCCC TGGACGCCGC AAAAGCGATT
GTCTGGTTCG GTCGCCAGTG TGGCATTGAA ATTGAAACCT GCGTAGCCAT TCCTACCACC
AGCGGTACCG GTTCCGAAGT GACCAGTGCT TGTGTCATCA GCGATCCTGA GAAAGGGATT
AAATATCCCC TTTTCGATAA TGCACTTTAT CCGGATATCG CGATCCTTGA TCCATCTCTG
ATCGTCAGCG TGCCACCGGC GATAACGGCC AATACCGGGA TGGATGTTTT AACCCATGCG
CTTGAAGCTT ATGTTTCTCC TCGCGCCAGC GACTTCACGG ATGCACTGGT CGAAAAAGCC
GTGCAAATTG TTTTTCAATA CCTGCCCACT GCAGTGAAAA AAGGCGATTG TCTTGCCACC
CGAGGCAAAA TGCATAATGC ATCGACGCTT GCAGGAATGG CCTTTAGCCA GGCCGGACTC
GGTTTAAACC ATGCAATTGC TCATCAACTC GGCGGACAGT TTCATCTGCA GCATGGACTG
GCGAATGCGC TCCTGCTCAC GGCAGTTATC CGCTTTAATG CTGGCGATCC GCGCACAGCC
AAACGCTATG CCCGGCTGGC GAAAACGTGC CATTTATGCC CGGACAACGC CAATGACACC
GCCAGCCTGA ATGCACTTAT TCAACATATT GAGCAGCTTA AAACAACCTG TACGCTGCCG
ACTCTTGCTA ATGCGCTGAA AGAAAAAAAA GCAGAATGGT CAATACGCAT ACCAGATATG
GTTCAGGCGG CACTGGCGGA TGCAACGTTG CGTACCAACC CGCGTGCGGC TGATGCCTCC
GCAATTGCAG AACTGCTCGA GGAGTTGTTA TGA
 
Protein sequence
MNTFSLQTRL YSGPGSLKVI SRFSHKHIWI ICDVFLARSP LINTLHQALP DSNRLSIFSE 
ITPDPTIQTV VKGIAQMQTL RPDVVIGFGG GSALDAAKAI VWFGRQCGIE IETCVAIPTT
SGTGSEVTSA CVISDPEKGI KYPLFDNALY PDIAILDPSL IVSVPPAITA NTGMDVLTHA
LEAYVSPRAS DFTDALVEKA VQIVFQYLPT AVKKGDCLAT RGKMHNASTL AGMAFSQAGL
GLNHAIAHQL GGQFHLQHGL ANALLLTAVI RFNAGDPRTA KRYARLAKTC HLCPDNANDT
ASLNALIQHI EQLKTTCTLP TLANALKEKK AEWSIRIPDM VQAALADATL RTNPRAADAS
AIAELLEELL