Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2145 |
Symbol | hyaA |
ID | 6143203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2154324 |
End bp | 2155442 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617021 |
Product | nickel-dependent hydrogenase 1, small subunit |
Protein accession | YP_001744196 |
Protein GI | 170683371 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.191691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.521589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACG AGGAAACATT TTACCAGGCC ATGCGGCGTC AGGGCGTTAC CCGGCGCAGC TTTCTCAAAT ATTGCAGTCT GGCTGCCACG TCGCTGGGAT TAGGCGCGGG AATGGCACCA AAGATTGCCT GGGCGCTGGA GAACAAACCG CGCATTCCGG TGGTATGGAT CCACGGTCTG GAATGCACCT GCTGTACCGA ATCTTTTATC CGCTCCGCTC ACCCGCTGGC AAAGGACGTC ATCCTTTCCC TGATTTCCCT CGATTACGAC GATACTTTGA TGGCTGCCGC CGGAACCCAG GCGGAAGAAG TCTTTGAAGA CATCATCACG CAATACAATG GCAAATATAT CCTCGCAGTA GAAGGTAATC CGCCGCTGGG CGAGCAGGGG ATGTTCTGTA TCAGCAGCGG ACGACCGTTT ATTGAGAAAC TCAAACGTGC CGCTGCCGGA GCCAGTGCGA TTATCGCCTG GGGAACCTGC GCGTCCTGGG GCTGCGTGCA GGCCGCGCGA CCCAATCCGA CGCAGGCAAC GCCTATCGAC AAAGTCATCA CCGACAAACC TATTATCAAA GTACCTGGCT GCCCGCCGAT CCCGGATGTG ATGAGCGCCA TCATTACTTA CATGGTGACC TTTGATCGCT TGCCAGATGT CGACAGAATG GGCCGTCCGT TGATGTTCTA TGGTCAGCGA ATCCACGATA AATGCTATCG CCGCGCCCAC TTCGACGCCG GAGAGTTCGT CCAGAGTTGG GACGATGACG CTGCCCGCAA AGGTTACTGC CTGTACAAAA TGGGCTGCAA AGGGCCTACC ACCTATAACG CCTGTTCCTC CACACGCTGG AATGATGGCG TTTCTTTCCC AATCCAGTCT GGTCACGGCT GCCTGGGCTG TGCGGAAAAT GGTTTCTGGG ATCGCGGTTC GTTCTACAGC CGCGTGGTCG ATATTCCGCA AATGGGTACT CATTCCACCG CCGATACCGT CGGCTTAACC GCGCTTGGCG TGGTGGCAGC GGCTGTTGGT GTGCACGCAG TCGCCAGCGC CGTTGACCAG CGCAGACGTC ATAACCAGCA ACCTACAGAA ACCGAACATC AGCCAGGCAA TGAGGATAAA CAGGCATGA
|
Protein sequence | MNNEETFYQA MRRQGVTRRS FLKYCSLAAT SLGLGAGMAP KIAWALENKP RIPVVWIHGL ECTCCTESFI RSAHPLAKDV ILSLISLDYD DTLMAAAGTQ AEEVFEDIIT QYNGKYILAV EGNPPLGEQG MFCISSGRPF IEKLKRAAAG ASAIIAWGTC ASWGCVQAAR PNPTQATPID KVITDKPIIK VPGCPPIPDV MSAIITYMVT FDRLPDVDRM GRPLMFYGQR IHDKCYRRAH FDAGEFVQSW DDDAARKGYC LYKMGCKGPT TYNACSSTRW NDGVSFPIQS GHGCLGCAEN GFWDRGSFYS RVVDIPQMGT HSTADTVGLT ALGVVAAAVG VHAVASAVDQ RRRHNQQPTE TEHQPGNEDK QA
|
| |