Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2625 |
Symbol | |
ID | 6066223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2877103 |
End bp | 2878221 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641602031 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_001725582 |
Protein GI | 170020628 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.245904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000111305 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAACG AGGAAACATT TTACCAGGCC ATGCGGCGTC AGGGCGTTAC CCGGCGCAGC TTTCTCAAAT ATTGTAGTCT GGCTGCCACG TCGCTGGGAT TAGGCGCGGG AATGGCACCA AAGATTGCCT GGGCGCTGGA GAACAAACCC CGCATTCCGG TGGTATGGAT CCACGGTCTG GAATGCACCT GCTGTACCGA ATCTTTTATC CGCTCCGCTC ACCCACTGGC GAAGGACGTC ATCCTTTCCC TGATTTCCCT CGATTATGAC GATACCCTGA TGGCTGCCGC CGGAACCCAG GCGGAAGAAG TCTTCGAAGA CATCATCACG CAATACAATG GCAAATATAT CCTCGCAGTA GAAGGTAATC CGCCGCTGGG CGAGCAGGGG ATGTTCTGTA TCAGCAGCGG TCGACCGTTT ATTGAGAAAC TCAAACGTGC CGCTGCCGGA GCCAGCGCGA TTATCGCCTG GGGAACCTGC GCGTCCTGGG GCTGCGTGCA GGCCGCGCGG CCCAATCCGA CGCAGGCAAC GCCTATCGAC AAAGTCATCA CCGACAAACC CATTATCAAA GTACCTGGCT GCCCGCCGAT CCCGGATGTG ATGAGCGCCA TCATTACTTA CATGGTGACC TTTGATCGCT TGCCAGATGT CGACAGAATG GGCCGTCCGC TGATGTTCTA TGGTCAGCGA ATCCACGATA AATGCTATCG CCGCGCCCAC TTCGACGCCG GAGAGTTCGT CCAGAGTTGG GATGATGACG CTGCCCGCAA AGGTTACTGC CTGTACAAAA TGGGCTGCAA AGGGCCTACC ACCTATAACG CCTGTTCCTC CACACGCTGG AATGATGGCG TTTCTTTCCC AATCCAGTCT GGTCACGGCT GCCTGGGCTG TGCGGAAAAT GGTTTCTGGG ATCGCGGTTC GTTCTACAGC CGCGTGGTCG ATATTCCGCA AATGGGTACT CATTCCACCG CCGATACCGT CGGTTTAACC GCGCTTGGCG TGGTGGCAGC GGCTGTTGGT GTGCACGCAG TCGCCAGCGC CGTTGACCAG CGCAGACGTC ATAACCAGCA ACCTACAGAA ACCGAACATC AGCCAGGCAA TGAGGATAAA CAGGCATGA
|
Protein sequence | MNNEETFYQA MRRQGVTRRS FLKYCSLAAT SLGLGAGMAP KIAWALENKP RIPVVWIHGL ECTCCTESFI RSAHPLAKDV ILSLISLDYD DTLMAAAGTQ AEEVFEDIIT QYNGKYILAV EGNPPLGEQG MFCISSGRPF IEKLKRAAAG ASAIIAWGTC ASWGCVQAAR PNPTQATPID KVITDKPIIK VPGCPPIPDV MSAIITYMVT FDRLPDVDRM GRPLMFYGQR IHDKCYRRAH FDAGEFVQSW DDDAARKGYC LYKMGCKGPT TYNACSSTRW NDGVSFPIQS GHGCLGCAEN GFWDRGSFYS RVVDIPQMGT HSTADTVGLT ALGVVAAAVG VHAVASAVDQ RRRHNQQPTE TEHQPGNEDK QA
|
| |