Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_1944 |
Symbol | |
ID | 5164616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 2252248 |
End bp | 2253183 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640549438 |
Product | hydrogenase 2 protein HybA |
Protein accession | YP_001230707 |
Protein GI | 148264001 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.422273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAACA AGAGTAGGAG AGAGTTCCTC AAGTTGGTCG GTGTGACCGG AGCAGGCCTT CTGACCGGTG CAGCTACTGC CAGCGGATCG GAAGGGCTGC ATGTGAATAA CGAGGAGATC GGCATGCTCT ATGACGCCAC CAAGTGCGTC GGCTGCAAGG CTTGCATGGC AGCCTGCAAG CGGGTCAACG GCGACTACGG CAGTCTTTCT TATGAAAAAG CCAAGTTCGA TCCCGATGGG CTGTGGGATG CGCCGACCGA CCTTTCCGGC AGCACCCGGA CCCTGATCAA GCTGTTCAAG GAGTCCGAAA GCCGCTGGTC GTACGTGAAA TATTCCTGCA TGCATTGCCA GAAGCCGTCC TGTGTCTCGG TCTGCCCGGT CAGCGCCATG ACCAAGGACA AGGTGAGCGG CGTCGTCGAT TACAACAAAA ACACCTGCAT CGGTTGTCGC TACTGCCAGG TGGCATGTGC CTTCAATATC CCCAAATTCC AGTGGGAAAA GTCGATCCCG CAGATCGTCA AATGTGATCT CTGCAAGAAT ACCAATCTGC GCGAGAAGGG GATTTCCGCC TGCGCCGAGG TCTGTCCGGT AGGGGCGATC AAGTTCGGCA AGCGCAAGGA TCTCTTGCAG GAGGCGAAAA CCAGGCTGCG GGAAAACCCC GACAAATACA TCGCCCATGT CTATGGCGAG CACGAAGCAG GCGGCACCAA TCACCTTTAC CTGGCCTCCA TGCCGTTCAA CAAGCTGGGC TTGCCGGATA TAAAGCCGCA GGCTCCGGCT GAATTTTCCG AGAAGATCCA GCACACCATC TACAAAGGGT TCATTGCCCC GGTCGCCCTC TACAGCACCC TCTGTTTCAT AGCCGTCAAG AACATGAAAA AGCACGACAA AACCGATGAC CATGACAAGC GGAAGCATGA CGGGGAGGAG CGATAA
|
Protein sequence | MKNKSRREFL KLVGVTGAGL LTGAATASGS EGLHVNNEEI GMLYDATKCV GCKACMAACK RVNGDYGSLS YEKAKFDPDG LWDAPTDLSG STRTLIKLFK ESESRWSYVK YSCMHCQKPS CVSVCPVSAM TKDKVSGVVD YNKNTCIGCR YCQVACAFNI PKFQWEKSIP QIVKCDLCKN TNLREKGISA CAEVCPVGAI KFGKRKDLLQ EAKTRLRENP DKYIAHVYGE HEAGGTNHLY LASMPFNKLG LPDIKPQAPA EFSEKIQHTI YKGFIAPVAL YSTLCFIAVK NMKKHDKTDD HDKRKHDGEE R
|
| |