Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3724 |
Symbol | |
ID | 6144657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3793580 |
End bp | 3794617 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618550 |
Product | putative dehydrogenase |
Protein accession | YP_001745690 |
Protein GI | 170681404 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATTA ACTGCGCCTT TATTGGCTTC GGCAAAAGCA CCACCCGTTA CCATCTGCCG TATGTACTTA ACCGCAAGGA TAGCTGGCAT GTCGCGCATA TTTTTCGCCG CCATGCGAAG CCGGAAGAAC AGGCCCCCAT TTATTCCCAT ATCCATTTCA CCAGCGATCT CGACGAAGTG CTAAACGATC CCGATGTTAA GCTGGTTGTC GTCTGCACCC ACGCGGACAG CCACTTCGAG TACGCGAAGC GCGCGCTGGA AGCCGGGAAA AATGTGCTGG TCGAAAAACC GTTCACTCCG ACAATTGCGC AGGCGAAAGA GCTGTTTGCA CTGGCGAAAA GCAAAGGGCT GATCGTTACG CCGTATCAGA ATCGTCGCTT TGATTCCTGT TTCCTGACAG CGAAAAAAGC GATTGAAAGC GGCAAGCTGG GAGAGATTGT CGAAGTGGAA AGCCATTTTG ACTATTACCG CCCGGTGGCA GAAACCAAAC CTGGGCTGCC GCAGGATGGC GCGTTCTATG GCCTTGGTGT GCATACGATG GACCAGATTA TTTCTCTGTT CGGTCGCCCG GATCACGTCG CTTATGACAT CCGCAGCCTG CGTAATAAAG CCAATCCGGA CGACACCTTT GAAGCGCAGC TGTTTTATGG CGATCTAAAA GCCATCGTCA AAACCAGCCA TCTGGTGAAA ATCGATTATC CGAAATTTAT CGTTCACGGT AAGAAAGGTT CGTTTATTAA ATACGGTATC GACCAGCAGG AAACCAGCCT GAAGGCTAAT ATTATGCCGG GCGAACCGGG ATTCGCAGCG GATGATTCGG TCGGTGTGCT GGAGTATGTC AATGACGAGG GTGTGACGGT CAGAGAAGAG ATGAAGCCGG AGGTGGGCGA TTACGGGCGC GTTTATGATG CGTTGTATCA AACCATCACC AACGGTGCGC CAAATTACGT CAAGGAATCT GAAGTTCTTA CCAATTTGGA AATCCTTGAA CGCGGTTTTG AGCAAGCCTC TCCCTCCACA GTGACTCTCG CGAAGTAA
|
Protein sequence | MVINCAFIGF GKSTTRYHLP YVLNRKDSWH VAHIFRRHAK PEEQAPIYSH IHFTSDLDEV LNDPDVKLVV VCTHADSHFE YAKRALEAGK NVLVEKPFTP TIAQAKELFA LAKSKGLIVT PYQNRRFDSC FLTAKKAIES GKLGEIVEVE SHFDYYRPVA ETKPGLPQDG AFYGLGVHTM DQIISLFGRP DHVAYDIRSL RNKANPDDTF EAQLFYGDLK AIVKTSHLVK IDYPKFIVHG KKGSFIKYGI DQQETSLKAN IMPGEPGFAA DDSVGVLEYV NDEGVTVREE MKPEVGDYGR VYDALYQTIT NGAPNYVKES EVLTNLEILE RGFEQASPST VTLAK
|
| |