Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1269 |
Symbol | |
ID | 7317758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1363654 |
End bp | 1364721 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643616157 |
Product | Hydrogenase (acceptor) |
Protein accession | YP_002513342 |
Protein GI | 220934443 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCAGG GCAAGACACC GGAAACACTG AGCGAGGAAC TGCGCCGACG CGGCATGAGC CGACGCACCT TCCTCAAGTT CTGCGCCGCC GTGGCATCCA GCATGGCCCT GCCCGCGACC ATGGCGCCGG TGATGGCGGA GGCCCTGGCC AAGGCCCGGC GCCAGTCGGT GATCTGGCTG TCCTTCCAGG AGTGCACCGG TTGCGTGGAG TCCCTGACCC GCTCCTACGC ACCCTCGCTC GAATCCCTGA TCTTCGATTT CATCTCCCTG GACTACCAGC ACGCCCTGCA GGCGGCCGCC GGCCACCAGG CCGAGGCGGC CCGCGAAGCG GCCATGAAGG AACACTGGGG CAAGTACCTG GTGATCGTGG ACGGCTCCAT CCCGCTCAAG GACGGCGGCG TCTATTCCAC CGTCGCGGGC GAGACCAACC TGGACACCCT GAGGCACGTG GCCGAGGGTG CCGCGGCCCT CATCAGCGTG GGCACCTGCG CGGCCTACGG CGGGCTCCCC AAGGCCCACC CCAATCCCAC CGGCGCTGCT GCGGTCACGG ACATCATCAA GGACAAGCCG GTGATCAACG TGCCGGGCTG CCCGCCCATC CCGGAAGTGA TGACCGGCGT GATCGCCAAC TTCCTCACCT TCGGCCGTCT GCCGGACATG GATCACCTGA ACCGGCCGAT CGCCTTCTTC GGCGACACCA TCCACGACCG CTGCTACCGG CGCCCCTTCT ACCGGCGCGG CCAGTTCGCC AAGCGCTTCG ACGACGAGGG CGCCCGCAAC GGCTGGTGCC TGTACGAACT GGGCTGCAAG GGTCCGGTGG TGCACAACGC CTGTGCCACC ACCAAGTGGA ACCAGAACCT CACCTTCCCC ATCCAGTCCG GCCACGGCTG CATCGGCTGC TCGGAACCCG ACTTCTGGGA CAAGGGCAGC TTCTACGCGC CGCTCTCCAC CGGCCGCTGG GGCAGCGCCG AAGGCATCGC CGCGGCGGCG GTGGTCGGCA CCACGGTGGG CATCGGCAGC GCCATGCTCT CCCGGGCGCG CCAGTCCGAC ATCATCAAGA AGGGGTGA
|
Protein sequence | MHQGKTPETL SEELRRRGMS RRTFLKFCAA VASSMALPAT MAPVMAEALA KARRQSVIWL SFQECTGCVE SLTRSYAPSL ESLIFDFISL DYQHALQAAA GHQAEAAREA AMKEHWGKYL VIVDGSIPLK DGGVYSTVAG ETNLDTLRHV AEGAAALISV GTCAAYGGLP KAHPNPTGAA AVTDIIKDKP VINVPGCPPI PEVMTGVIAN FLTFGRLPDM DHLNRPIAFF GDTIHDRCYR RPFYRRGQFA KRFDDEGARN GWCLYELGCK GPVVHNACAT TKWNQNLTFP IQSGHGCIGC SEPDFWDKGS FYAPLSTGRW GSAEGIAAAA VVGTTVGIGS AMLSRARQSD IIKKG
|
| |