Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1275 |
Symbol | |
ID | 8415570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1532728 |
End bp | 1533960 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645024242 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_003181634 |
Protein GI | 257791028 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.353073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACAG AGGCCACAGT TTCAGAGTTT CAGAACATGC TTTCCGCGCG CGGCGTCAGC CGCCGCAGCT TCATGAAGCT CTGCGGCGCC GTTGCGGTCG CGGCCGGATT GTCCGAGCTC GCTGCGCCGC GCGTGGCGCA GGCCCTCGAG AAATCTGTGA TCGGCGCAAC GAAGGGCAAG CTGTATCCGG TTATCTGGAT CGAGGGCGCG TCGTGCACGG GTTGTACCGA GTCGTTTGCG CAGGTCGAGA CGCCGGATGC GGCTTCGATC GTGCTGGACA TGATCTCGCT CAACTACTCC GAGACCCTGT CGGCGGCAGC CGGCTGGTCG ATGGAGGAGG CCAAGGAGCA GACGATCGAG GCCGGCAATT ACATCCTCGT GTACGAGGGA GCTGTGCTGG AAGGCTGGGG AGGCCAGGCT CTGCGCGTCG CCGACAAGCC CGGTACGGAG CATCTGATCG AGGCTGCCGA GAAAGCCAAC GCCGTAGTCG CGCTGGGTTC CTGCGCGGTC AACGGCGGCT GGATGGGCGC TCATCCCAAC CAGGCGGGTG CGCTCGGCGT GCAGGCGTTC CTGAAGAAGG CCGGCATCAA CACGCCGGTT GTGAACGTTC CCGGTTGCCC GGCCAACCCC GAGTGGCTCG TGGCCGTGCT GGCGGACGTG ATCTTCCTTG AGAAGCTTCC CGCGCTCAAC AGCGAGGACA AGCCTGCCGG CATCTTCGAC CAGACGATCC ACGACAACTG CGAGCGCCGC GGCCACTTCG AGAACGGCGA GTTCGTGTAC AAGTTCGGCT CCGAGGAAGA GGCCAAGGGA TACTGCCTGT ACCCGCTTGG CTGCCGCGGA CCTCAGACGA AGGCGAACTG CGGCGTGACC ATGTGGAACA ACCGTCGCAG CTGGTGCGTG CAGTCCGGCG CTCCCTGCAT CGGATGCTGC GAGGCCAACC CGAACGATCC CGGCCATAAC TGGGTCGAGG TCAACACCCC GTTCTACAAG CGTCATCGCG ACCTGCGCAT CGGCGACTGG ATGGTTCAGC CCGGCACTAT CGCGCTCGGC ATCACCGGCA TTCTTGCCGC GGCCCTCGTG GTGCACGGTT TCGGCATGAA GATCACCGGC CGCATGGACG GCGGCGCCGA CTTCGAGAAG GTGCGCGGCT GGGACGCGAA GCATCCCGAC AAGTCCATCG GCAAGTACGA CGAGGCCGAC CTTAACAACG ACGACAAAAA AGAAGGGAGG TAA
|
Protein sequence | METEATVSEF QNMLSARGVS RRSFMKLCGA VAVAAGLSEL AAPRVAQALE KSVIGATKGK LYPVIWIEGA SCTGCTESFA QVETPDAASI VLDMISLNYS ETLSAAAGWS MEEAKEQTIE AGNYILVYEG AVLEGWGGQA LRVADKPGTE HLIEAAEKAN AVVALGSCAV NGGWMGAHPN QAGALGVQAF LKKAGINTPV VNVPGCPANP EWLVAVLADV IFLEKLPALN SEDKPAGIFD QTIHDNCERR GHFENGEFVY KFGSEEEAKG YCLYPLGCRG PQTKANCGVT MWNNRRSWCV QSGAPCIGCC EANPNDPGHN WVEVNTPFYK RHRDLRIGDW MVQPGTIALG ITGILAAALV VHGFGMKITG RMDGGADFEK VRGWDAKHPD KSIGKYDEAD LNNDDKKEGR
|
| |