Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4237 |
Symbol | |
ID | 4073164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5023726 |
End bp | 5024931 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637986269 |
Product | hydrogenase (NiFe) small subunit (hydA) |
Protein accession | YP_593311 |
Protein GI | 94971263 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAGA AGGCACTCAT GTCGAAGAAG CCGACGATTG AAGAACATCT GAAAGCAACT GGTGTGACGC GGAGAAGTTT CGTTCAGCTC TGCGGAATGC TGATGGCGGC AGCCCCAATA GGCTTGTCGC TCACGAGCAA GGCTTCCGCA CAGGAAGTCG CCAAGGTCGT CGGCAAAGCG AAGCGACCCT CGGTGATCTG GTTGCACTTC CAGGATTGCA CCGGTTGTAC CGAGACGTTG TTGCGTACAT CGGCTCCCGA CGTCGCGCAC CTCATCCTCG ACGTCATCTC GCTGGACTAT CACGAGACAT TGATGGCTGC CTCCGGAGCG CAGGCCGAAG CCGCGCTGCA ATCGGCGATT GCCGACAACG CCGGCAAGTT CGTGCTCGTC GTTGAAGGCG CCATCCCCGC GCGCGACGAC GGCAACTACA TGCTGCTCAA CGGCAAGCCC GCCATCCAGG TGATCAAGGA GACGGCGGCC AAGGCGGCGG CAGTGATCGC CATGGGCTCC TGCGCTTCTT GGGGAGGCGT TCCCTCGGCC GATCCCAATC CGACTGGCGC CATCGGCGTG GATTCTGTCA TCTCCGGCAA GCCGATCGTG AACCTGCCAG GATGTCCGCC GAACCCTTAC AACTTGCTTG CCACGGTGCT TGAGTACGTC GTTATGGGCA AGCTGCCCGC GCTCGACGAA TACGGCCGTC CGAAGTTCGC CTATGAGCGC GTGATTCACG AGAATTGCCC GCGCCGCGCG CACTTCGATG CCGGCCGCTT CGCCGCTACG TTCGGCGACG AAGGTCACCG CAAAGGCTGG TGTCTCTACA AACTGGGCTG CAAAGGACCG GTCACGCACG CGGCCTGCTC AACGCGCCAC TTCTGCGAAG TGCCCGGCGT GTGGCCGATT GGCTTGGGCG TTCCTTGTGT CGGATGCACG GAAAAATCCG TCATCTGGCA AATGGGAACG TTCCAGACCG TGCCGATCCA TCTTGCGACG CCGCCCGACA CGTATCCGCC GGTCTTCAGC GGATCAGGCA AGGTTGGAGT CGGCGCGGCA GCCTTAGTCG GTGCCATCGC CGGCGCCGTC GGCGGCGCGA CGTGGGTCGC CTCCCAGAAA TTCAAGAGTT CCAATGAAGC TGGAGTTGAG CACATTGGAC TCGATATCGC GCACGTAGAT AAGCAGAAGT CAGCCAAAGT TGGAAAAGAG GAATAA
|
Protein sequence | MGEKALMSKK PTIEEHLKAT GVTRRSFVQL CGMLMAAAPI GLSLTSKASA QEVAKVVGKA KRPSVIWLHF QDCTGCTETL LRTSAPDVAH LILDVISLDY HETLMAASGA QAEAALQSAI ADNAGKFVLV VEGAIPARDD GNYMLLNGKP AIQVIKETAA KAAAVIAMGS CASWGGVPSA DPNPTGAIGV DSVISGKPIV NLPGCPPNPY NLLATVLEYV VMGKLPALDE YGRPKFAYER VIHENCPRRA HFDAGRFAAT FGDEGHRKGW CLYKLGCKGP VTHAACSTRH FCEVPGVWPI GLGVPCVGCT EKSVIWQMGT FQTVPIHLAT PPDTYPPVFS GSGKVGVGAA ALVGAIAGAV GGATWVASQK FKSSNEAGVE HIGLDIAHVD KQKSAKVGKE E
|
| |