Gene EcSMS35_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3724 
Symbol 
ID6144657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3793580 
End bp3794617 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content51% 
IMG OID641618550 
Productputative dehydrogenase 
Protein accessionYP_001745690 
Protein GI170681404 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATTA ACTGCGCCTT TATTGGCTTC GGCAAAAGCA CCACCCGTTA CCATCTGCCG 
TATGTACTTA ACCGCAAGGA TAGCTGGCAT GTCGCGCATA TTTTTCGCCG CCATGCGAAG
CCGGAAGAAC AGGCCCCCAT TTATTCCCAT ATCCATTTCA CCAGCGATCT CGACGAAGTG
CTAAACGATC CCGATGTTAA GCTGGTTGTC GTCTGCACCC ACGCGGACAG CCACTTCGAG
TACGCGAAGC GCGCGCTGGA AGCCGGGAAA AATGTGCTGG TCGAAAAACC GTTCACTCCG
ACAATTGCGC AGGCGAAAGA GCTGTTTGCA CTGGCGAAAA GCAAAGGGCT GATCGTTACG
CCGTATCAGA ATCGTCGCTT TGATTCCTGT TTCCTGACAG CGAAAAAAGC GATTGAAAGC
GGCAAGCTGG GAGAGATTGT CGAAGTGGAA AGCCATTTTG ACTATTACCG CCCGGTGGCA
GAAACCAAAC CTGGGCTGCC GCAGGATGGC GCGTTCTATG GCCTTGGTGT GCATACGATG
GACCAGATTA TTTCTCTGTT CGGTCGCCCG GATCACGTCG CTTATGACAT CCGCAGCCTG
CGTAATAAAG CCAATCCGGA CGACACCTTT GAAGCGCAGC TGTTTTATGG CGATCTAAAA
GCCATCGTCA AAACCAGCCA TCTGGTGAAA ATCGATTATC CGAAATTTAT CGTTCACGGT
AAGAAAGGTT CGTTTATTAA ATACGGTATC GACCAGCAGG AAACCAGCCT GAAGGCTAAT
ATTATGCCGG GCGAACCGGG ATTCGCAGCG GATGATTCGG TCGGTGTGCT GGAGTATGTC
AATGACGAGG GTGTGACGGT CAGAGAAGAG ATGAAGCCGG AGGTGGGCGA TTACGGGCGC
GTTTATGATG CGTTGTATCA AACCATCACC AACGGTGCGC CAAATTACGT CAAGGAATCT
GAAGTTCTTA CCAATTTGGA AATCCTTGAA CGCGGTTTTG AGCAAGCCTC TCCCTCCACA
GTGACTCTCG CGAAGTAA
 
Protein sequence
MVINCAFIGF GKSTTRYHLP YVLNRKDSWH VAHIFRRHAK PEEQAPIYSH IHFTSDLDEV 
LNDPDVKLVV VCTHADSHFE YAKRALEAGK NVLVEKPFTP TIAQAKELFA LAKSKGLIVT
PYQNRRFDSC FLTAKKAIES GKLGEIVEVE SHFDYYRPVA ETKPGLPQDG AFYGLGVHTM
DQIISLFGRP DHVAYDIRSL RNKANPDDTF EAQLFYGDLK AIVKTSHLVK IDYPKFIVHG
KKGSFIKYGI DQQETSLKAN IMPGEPGFAA DDSVGVLEYV NDEGVTVREE MKPEVGDYGR
VYDALYQTIT NGAPNYVKES EVLTNLEILE RGFEQASPST VTLAK