Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_2056 |
Symbol | |
ID | 6368026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010814 |
Strand | - |
Start bp | 2192013 |
End bp | 2193134 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642677469 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_001952292 |
Protein GI | 189425115 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAGAG ATGAATGTGC TGGAAAGAAA CAGGAAGGTT TCTCGGTTGC CAGGATGCTT GAAGAACGGG GAGTCTCGCG GCGTGATTTC CTGAAATTCT GTTCTACTGT CACCGCTGCC ATGGCGTTAC CTGCCACCAT GGCACCCAAG GTGGCTCAAG CCCTGGACAA GGTGCAGCGT CCTCCGCTGG TCTGGCTGGA GTTTCAGGAT TGCTGCGGCG ACACGGAGGC TCTGTTACGT TCAGCCAACC CCACCGTGGG AGAGCTGGTA CTGGATATCC TCTCGGTTGA TTACCATGAA ACCATCATGG CTGCTGCCGG TCATCAGGCT GAGGCCAACC TCGAAAAGAC CATCAAAGAG TTCCAGGGCA AATACCTCTG CGTGGTTGAG GGTTCCATCC CGATGAAGGA AGGAGGAGCC TATGGCTGTG TTGGCGGCAA GTCCCATCTG GCCCGGGCCA AACAGGTCTG TGGATCTGCA GCAGCCACCA TTGCTGTGGG CACCTGTGCC AGTTTCGGCG GTATTCCCGC TGCTGCTCCC AATCCCACCG GCGCAGTTGG GGTCAAAGAG GCGGTGCCCG GTGCTACGGT GATCAACCTG CCCGGCTGCC CCTGCAATGC CGATAACCTG ACCGCTGTAG TGGTTCACTT CCTTACCTTT GGTAAACTTC CCAGTCTTGA CAGCCATGGC CGTCCCCTGT TTGCCTACGG CAAGCGGATT CATGACAACT GTGAACGTCG TCCCCACTTT GATGCCGGTC AGTATGTTGA GCATTGGGGG GATGATGCCC ACCGCAAGGG GCACTGCCTC TACAAGATGG GCTGTAAGGG TCCGGCAACC TTCCATAACT GTCCCACCCA GCGTTTTAAC GAGAGAATCA GCTGGCCGGT TGCTGCCGGT CATGGCTGTG TCGGCTGTTC CGAACCCCAG TTCTGGGATA CTTCGCCACT CTATCGCCGT CTGCCCAACG TGCCTGGCTT TGGTATTGAG CAGAGTGCCG ACAAGATCGG GCTTGCCTTT ACTGCCGGTG TGGGTGGTGC CTTTGCTATC CATGGTGCCA TGAATGCCCT GCGCAAGGAT AAAGATACGG CTGACGAGAA CACAAAAGAC GGGGAGGAAT AG
|
Protein sequence | MDRDECAGKK QEGFSVARML EERGVSRRDF LKFCSTVTAA MALPATMAPK VAQALDKVQR PPLVWLEFQD CCGDTEALLR SANPTVGELV LDILSVDYHE TIMAAAGHQA EANLEKTIKE FQGKYLCVVE GSIPMKEGGA YGCVGGKSHL ARAKQVCGSA AATIAVGTCA SFGGIPAAAP NPTGAVGVKE AVPGATVINL PGCPCNADNL TAVVVHFLTF GKLPSLDSHG RPLFAYGKRI HDNCERRPHF DAGQYVEHWG DDAHRKGHCL YKMGCKGPAT FHNCPTQRFN ERISWPVAAG HGCVGCSEPQ FWDTSPLYRR LPNVPGFGIE QSADKIGLAF TAGVGGAFAI HGAMNALRKD KDTADENTKD GEE
|
| |