Gene EcHS_A3639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3639 
Symbol 
ID5594327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3626804 
End bp3627841 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content51% 
IMG OID640922755 
Productputative dehydrogenase 
Protein accessionYP_001460236 
Protein GI157162918 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.117542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCATCA ACTGCGCCTT TATTGGCTTC GGCAAAAGCA CCACCCGTTA CCATCTGCCG 
TATGTACTTA ACCGCAAGGA TAGTTGGCAT GTCGCGCATA TTTTTCGTCG CCATGCGAAG
CCGGAAGAAC AGGCTCCCAT TTATTCCCAT ATCCATTTCA CCAGCGATCT CGACGAAGTA
CTAAACGATC CCGATGTTAA GCTGGTTGTT GTCTGCACCC ACGCGGACAG CCATTTCGAG
TACGCGAAAC GCGCGCTGGA AGCCGGGAAA AATGTGCTGG TCGAAAAACC GTTCACCCCG
ACGCTTGCCC AGGCGAAGGA GCTGTTTGCG TTGGCGAAAA GCAAAGGGCT GACCGTCACG
CCGTATCAGA ATCGTCGCTT TGACTCCTGC TTCCTGACAG CGAAAAAAGC GATTGAAAGT
GGCAAGTTGG GAGAGATTGT TGAAGTGGAA AGCCATTTTG ACTATTACCG CCCGGTCGCA
GAAACCAAAC CTGGGTTGCC GCAGGATGGC GCGTTCTATG GCCTTGGTGT GCATACGATG
GACCAGATTA TTTCTCTGTT CGGTCGCCCG GATCACGTCG CTTATGACAT CCGCAGCCTG
CGCAATAAAG CCAATCCTGA CGATACTTTC GAAGCGCAAC TGTTTTATGG CGATCTAAAA
GCCATCGTCA AAACCAGCCA TCTGGTGAAA ATCGATTATC CGAAATTTAT CGTTCACGGT
AAGAAAGGTT CGTTTATTAA ATATGGTATC GACCAGCAGG AAACCAGCCT GAAGGCTAAT
ATTATGCCGG GCGAACCGGG ATTCTCAGCG GATGATTCGG TCGGTGTGCT GGAGTATGTC
AATGACGAGG GCGTGACGGT CAGAGAAGAG ATTAAGCCGG AGATGGGCGA TTACGGGCGC
GTTTATGATG CGTTGTATCA AACCATCACC CACGGTGCGC CAAATTACGT CAAGGAATCT
GAAGTTCTTA CGAATCTGGA AATCCTTGAA CGCGGATTCG AGCAAGCCTC TCCCTCCACA
GTGACTCTCG CGAAGTAA
 
Protein sequence
MVINCAFIGF GKSTTRYHLP YVLNRKDSWH VAHIFRRHAK PEEQAPIYSH IHFTSDLDEV 
LNDPDVKLVV VCTHADSHFE YAKRALEAGK NVLVEKPFTP TLAQAKELFA LAKSKGLTVT
PYQNRRFDSC FLTAKKAIES GKLGEIVEVE SHFDYYRPVA ETKPGLPQDG AFYGLGVHTM
DQIISLFGRP DHVAYDIRSL RNKANPDDTF EAQLFYGDLK AIVKTSHLVK IDYPKFIVHG
KKGSFIKYGI DQQETSLKAN IMPGEPGFSA DDSVGVLEYV NDEGVTVREE IKPEMGDYGR
VYDALYQTIT HGAPNYVKES EVLTNLEILE RGFEQASPST VTLAK