Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1087 |
Symbol | hyaA |
ID | 5590626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1110800 |
End bp | 1111918 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640924790 |
Product | nickel-dependent hydrogenase, small subunit |
Protein accession | YP_001462203 |
Protein GI | 157155331 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000000220987 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAACG AGGAAACATT TTACCAGGTC ATGCGGCGTC AGGGCGTTAC CCGGCGCAGC TTTCTCAAAT ATTGTAGTCT GGCTGCCACG TCGCTGGGAT TAGGCGCGGG AATGGCACCA AAGATTGCCT GGGCGCTGGA GAACAAACCC CGCATTCCGG TGGTATGGAT CCACGGTCTG GAATGCACCT GCTGTACCGA ATCTTTTATC CGCTCCGCTC ACCCACTGGC GAAGGACGTC ATCCTTTCCC TGATTTCCCT CGATTATGAC GATACTTTGA TGGCTGCCGC CGGAACCCAG GCGGAAGAAG TCTTCGAAGA CATCATCACG CAATACAATG GCAAATATAT CCTCGCAGTA GAAGGTAATC CGCCGCTGGG CGAGCAGGGG ATGTTCTGTA TCAGCAGCGG TCGACCGTTT ATTGAGAAAC TCAAACGTGC CGCTGCCGGA GCCAGCGCGA TTATCGCCTG GGGAACCTGC GCGTCCTGGG GCTGCGTGCA GGCCGCGCGG CCCAATCCGA CGCAGGCAAC GCCTATCGAC AAAGTCATCA CCGACAAACC CATTATCAAA GTACCTGGCT GCCCGCCGAT CCCGGATGTG ATGAGCGCCA TCATTACTTA CATGGTGACC TTTGATCGCT TGCCAGATGT CGACAGAATG GGCCGTCCGC TGATGTTCTA TGGTCAGCGA ATCCACGATA AATGCTATCG CCGCGCCCAC TTCGACGCCG GAGAGTTCGT CCAGAGTTGG GATGATGACG CTGCCCGCAA AGGTTACTGC CTGTACAAAA TGGGCTGCAA AGGGCCTACC ACCTATAACG CCTGTTCCTC CACACGCTGG AATGATGGCG TTTCTTTTCC AATCCAGTCT GGTCACGGCT GCCTGGGCTG TGCGGAAAAT GGTTTCTGGG ATCGCGGTTC GTTCTACAGC CGCGTGGTCG ATATTCCGCA AATGGGTACT CATTCCACTG CCGATACCGT CGGTTTAACC GCGCTTGGCG TGGTGGCAGC GGCTGTTGGT GTGCACGCAG TCGCCAGCGC CGTTGACCAG CGCAGACGTC ATAACCAGCA ACCTACAGAA ACCGAACATC AGCCAGGCAA TGAGGATAAA CAGGCATGA
|
Protein sequence | MNNEETFYQV MRRQGVTRRS FLKYCSLAAT SLGLGAGMAP KIAWALENKP RIPVVWIHGL ECTCCTESFI RSAHPLAKDV ILSLISLDYD DTLMAAAGTQ AEEVFEDIIT QYNGKYILAV EGNPPLGEQG MFCISSGRPF IEKLKRAAAG ASAIIAWGTC ASWGCVQAAR PNPTQATPID KVITDKPIIK VPGCPPIPDV MSAIITYMVT FDRLPDVDRM GRPLMFYGQR IHDKCYRRAH FDAGEFVQSW DDDAARKGYC LYKMGCKGPT TYNACSSTRW NDGVSFPIQS GHGCLGCAEN GFWDRGSFYS RVVDIPQMGT HSTADTVGLT ALGVVAAAVG VHAVASAVDQ RRRHNQQPTE TEHQPGNEDK QA
|
| |