Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_1943 |
Symbol | |
ID | 5164615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 2251128 |
End bp | 2252264 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640549437 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_001230706 |
Protein GI | 148264000 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.162149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGGAG GTGTTATGAA GAAAGAAGAA GATTTTTTGT GTGCAGGGGT ATCCCGAAGA AGTTTTATGA AAACCTGCAT AACTGCCACG GCCATGATGG GGCTGCCGTT CAGCATGCAT ACCAAGGTTG CCGAAGCGAT GGAGAAAAAC GGCAACCCTT CGGTAATCTG GCTGCATTTC CAGGAATGTA CCGGTTGTTC AGAGTCGCTC CTCAGGTCTA CCCATCCGAC AATTTCGACT CTGATCCTGG ATATGATATC CCTCGACTAT CACGAGACGT TGATGGCCGG ATCAGGCGCC CAGGCTGAAA AGTCGCTGCA CGATTCGATG CTCGCCAACA AAGGCAAGTA CTTGCTGGTT GTCGAAGGAG CGATTCCGAC CAAGGAGAGC GGCATTTATT GCAAGGTCGG CGGCAAGACT GCTCTCGAAT CCTTGCAGCA TGCGGCTTCA AATGCGGCTG CCATCATCTC CATCGGCACC TGCGCATCTT ACGGCGGAAT CCAGTCTGTC GGCCCGAATC CCACCGGCGC CGTAGGGGTG CGGGATATCG TCAAGGACAA GCCGATCATC AACATTCCCG GCTGCCCTCC CAGTCCCTAT AATCTGCTTT CCACCGTGAT GTATTACCTG ACGTTCAAAA AAATACCCGA GCTGGATGCA CTCGGACGGC CGAAATTCGC TTACGGCAGA AAGATCCACG AGCATTGCGA GCGGCGGCCC CATTTCGATG CCGGCCGGTT TGCCAAGGCG TACGGTGATG ATACCCATGC CCAGGGATAC TGCTTGTTCA AGCTCGGCTG CAAGGGACCT GCAACCTATG CCAACTGTTC CGTACAGCGC TTTAATGAAG TTGGCGTCTG GCCGGTATCT GTCGGCCATC CCTGTATCGG CTGTACCGAG CCGGATGTGC TCTTTAAAAT GGCGATTGCC GACAAGGTGC AGATACACGA ACCTACTCCG TTTGACAGTT ATGCACCGGT AGATTTGAAG GAAAAAGGTA AGGGTCCGGA ACCGTTGACC ACGGGCTTTG TCGGACTTGC TGCGGGCGCT GCCCTTGGGG CCGGAGCAAT GCTGGCCAAA AAGCTGCCGA AAGATGATGG CCACAAGGAG GACGACCACC ATGAAGAACA AGAGTAG
|
Protein sequence | MSGGVMKKEE DFLCAGVSRR SFMKTCITAT AMMGLPFSMH TKVAEAMEKN GNPSVIWLHF QECTGCSESL LRSTHPTIST LILDMISLDY HETLMAGSGA QAEKSLHDSM LANKGKYLLV VEGAIPTKES GIYCKVGGKT ALESLQHAAS NAAAIISIGT CASYGGIQSV GPNPTGAVGV RDIVKDKPII NIPGCPPSPY NLLSTVMYYL TFKKIPELDA LGRPKFAYGR KIHEHCERRP HFDAGRFAKA YGDDTHAQGY CLFKLGCKGP ATYANCSVQR FNEVGVWPVS VGHPCIGCTE PDVLFKMAIA DKVQIHEPTP FDSYAPVDLK EKGKGPEPLT TGFVGLAAGA ALGAGAMLAK KLPKDDGHKE DDHHEEQE
|
| |