Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3466 |
Symbol | hyb0 |
ID | 5588909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3478405 |
End bp | 3479523 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640927094 |
Product | hydrogenase 2 small subunit |
Protein accession | YP_001464464 |
Protein GI | 157157587 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGAG ATAACACCCT CATCCATTCT CACGGCATTA ACCGTCGTGA TTTCATGAAG CTTTGTGCAG CATTAGCCGC CACCATGGGG TTAAGTAGCA AAGCCGCTGC AGAGATGGCC GAATCGGTTA CTAACCCGCA GCGTCCGCCA GTTATCTGGA TTGGCGCGCA GGAGTGCACC GGTTGTACGG AATCTCTGCT TCGTGCAACG CATCCAACGG TAGAAAACCT CGTGCTGGAG ACTATCTCTC TGGAGTATCA CGAAGTGCTT TCCGCCGCCT TCGGTCATCA GGTCGAAGAG AACAAACATA ACGCACTTGA GAAGTACAAA GGGCAGTATG TGTTGGTTGT GGATGGTTCC ATCCCATTAA AAGATAACGG TATTTATTGC ATGGTTGCTG GTGAGCCGAT TGTGGATCAC ATCCGCAAAG CGGCAGAAGG CGCAGCAGCG ATTATCGCTA TCGGTTCCTG CTCTGCGTGG GGCGGTGTTG CCGCAGCTGG AGTTAACCCA ACTGGCGCAG TCAGCCTGCA AGAAGTTCTG CCAGGCAAAA CCGTTATCAA TATTCCGGGC TGCCCGCCGA ACCCGCACAA CTTCCTCGCG ACCGTTGCGC ACATCATCAC TTACGGCAAA CCGCCGAAAC TGGATGACAA AAATCGTCCG ACCTTCGCCT ATGGCCGTCT GATTCACGAA CACTGCGAAC GTCGCCCGCA CTTCGATGCT GGTCGTTTTG CCAAAGAGTT CGGTGATGAA GGCCACCGTG AAGGCTGGTG CCTGTACCAC CTCGGCTGTA AAGGGCCAGA AACTTACGGC AACTGCTCAA CGCTGCAATT CTGCGATGTT GGCGGCGTGT GGCCGGTAGC GATTGGTCAC CCATGCTATG GCTGTAACGA AGAAGGTATC GGCTTCCATA AAGGCATCCA TCAGCTTGCC AACGTCGAAA ATCAGACTCC GCGTTCACAG AAACCGGATG TTAACGCTAA AGAGGGCGGC AACGTCTCTG CAGGCGCTAT TGGTTTGCTC GGCGGTGTGG TTGGGCTGGT TGCCGGTGTC AGCGTGATGG CGGTGCGTGA ACTGGGTCGT CAGCAAAAGA AAGATAACGC TGACTCACGG GGAGAATAA
|
Protein sequence | MTGDNTLIHS HGINRRDFMK LCAALAATMG LSSKAAAEMA ESVTNPQRPP VIWIGAQECT GCTESLLRAT HPTVENLVLE TISLEYHEVL SAAFGHQVEE NKHNALEKYK GQYVLVVDGS IPLKDNGIYC MVAGEPIVDH IRKAAEGAAA IIAIGSCSAW GGVAAAGVNP TGAVSLQEVL PGKTVINIPG CPPNPHNFLA TVAHIITYGK PPKLDDKNRP TFAYGRLIHE HCERRPHFDA GRFAKEFGDE GHREGWCLYH LGCKGPETYG NCSTLQFCDV GGVWPVAIGH PCYGCNEEGI GFHKGIHQLA NVENQTPRSQ KPDVNAKEGG NVSAGAIGLL GGVVGLVAGV SVMAVRELGR QQKKDNADSR GE
|
| |