Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5152 |
Symbol | |
ID | 8745700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | - |
Start bp | 47973 |
End bp | 49928 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646515509 |
Product | NADH/Ubiquinone/plastoquinone (complex I) |
Protein accession | YP_003406456 |
Protein GI | 284176179 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGCAG ACCTACGACC GCTCGCCGCC GTGTTGGTGT CGGCGGTCGC GATCGTCCTG ATTGTCGCGT CGCATCGCCG TCCGAATCTC CGCGAGGGCT GGTCCGTACT GGCCGCGCTC GGCAAGTTCG GAATCGTCGC CAGCATGCTC CCCGCGGTCA TGTCCGGTAC CGTCTACTCG TGGAGCCTCT ACGAGAGCAC GGGGCTTCGG TTCCTCCCGG GCGTCGACTT CGCGCTGCGG GCGGATCCGC TGGGGATCCT CTTTGCCTTG CTGGCGAGTT TCCTCTGGAT CTTCACGTCC TTCTACGCGG CGGGCTACAT GCGCGGGCTC GACGAGCACG CCCAGACCCG CTTTTTCGCC TCGTTCGCGG CGAGCCTCTC GGCCGCGGTC GGGATCGCCT TCGCCGCGAA CCTGGTGACG ATCTTCGTCT TCTACGAGCT GCTGTCGCTC GTAACCTACC CGCTGGTCGC CCACAACGAG GACAACGAGG CCCGGATCGC CGGCCGGAAG TACCTCACCT ACACGTTCTT CGGCGGCGGG GTCTTCCTGC TGGCCGGGAC CGTGATGGTC TACTGGCTGA CCGCGTCGAT CAACGGCGAT CCGACGCTGG CCTTCGAGTC CGGCGGGATG GAGGCGCTCG CGACGGCCGC GCAAGCCGAA CCCGGCTTCG CGCAGGCCGC GTTCTTCCTG CTGATCGCCG GCTTCGGCGT CAAGGCGGCG CTGATGCCGC TGCATTCGTG GCTCGCGGAC GCGATGGTCG CGCCGACGCC GGTCTCCGGG CTGCTCCACG CGGTCGCGGT CGTCAAGTCC GGCGCCTTCG GCGTCGCGCG GGTCATCTTA GACGTCTACG GACCGGGACT GATCCACGAC CTGCCACTCG ACGTTCCGGG GATCGGCGAG GTCGGGCTCA ACATCCCCGT CGCGATCGTC GCCGCGTTCA CGCTGACCGC GGCCTCTATC ATCGCGATGC GCAAGGACCA CCTCAAGCGC CGGCTGGCCT ACTCGACGAC CGCACAGCTG TCCTACATCG TGCTCGGGCT CTCGATGTTA CACCCCTACG CGATGGTTGG CGCCTTGTTC CACATCCCCG CCCACGCGTT CGCGAAGCTC ACGCTGTTCT TCTGTGCGGG GGCGATCCAC GTCGAGACAC ACACCGACTA CATCAGCGAC ATGGCCGGCA TCGGCAAACG GATGCCGCTG ACGATGGGTG CGTTCACGAT CGGCGCGGCC GGGATGGCCG GCCTCCCGCC GATCGCCGGC TTCGTCAGCA AGTTCTACAT GCTGATCGGC TCCGGCTACA TGGGCGGCGA GTACTGGATC TTCGCCGGCA CGCTGCTCCT CTCGGGCGTG CTCAACATCG CCTACTTCTG GCCGGTCGTC TACACGGCCT TCTTCGAGAG CGAGGACCGC CACGACGCCA AACCGGTCCT TGAGTTCCCG CGCGGCGGGA TCTTCGAGTC CTACGGTGAC ACCGCTCGGG GCCAGAAGGA AGGCGTCGCC ACGGACGGCG GCGCCGATCG GTCCGAGCGG GAAGCCCCGG CGGACGTGGA CGACGAGGGC GCCGACGCTG ACGCGGCTGC CGTCGACGAC GAGAACGGGA ACGAGGACTA CGAGTACGCC GTCGATCAGT ATCCGAGCGA CCACACCGTT CCCGACAACG AGCAGGGCGA ACACGTCAGC TCCGTCGACC ACCACGGCGA CCACGACGAC CACCTCACCG GCGGACCGCC GGCCGGCGGC TGGCAGCGAC AGTCGCCGTT CGGCGAGAGC ACGTGGCTCA TGCTGGCGCC GATCGCGATC ATCGCGACCG GCGCGGTCGT CCTCGGGGTC GTCCCCGACC ACGCCGTCTT CCTCGACCTG ATCACGCACA TCGTGGAGGG CGTCTTCGGC GTCGACTCCT TCGACCAACT GCAGGGCCTG TCCCTCGAGG AGGCCCTGGA GGTGATGTCC GAATGA
|
Protein sequence | MVADLRPLAA VLVSAVAIVL IVASHRRPNL REGWSVLAAL GKFGIVASML PAVMSGTVYS WSLYESTGLR FLPGVDFALR ADPLGILFAL LASFLWIFTS FYAAGYMRGL DEHAQTRFFA SFAASLSAAV GIAFAANLVT IFVFYELLSL VTYPLVAHNE DNEARIAGRK YLTYTFFGGG VFLLAGTVMV YWLTASINGD PTLAFESGGM EALATAAQAE PGFAQAAFFL LIAGFGVKAA LMPLHSWLAD AMVAPTPVSG LLHAVAVVKS GAFGVARVIL DVYGPGLIHD LPLDVPGIGE VGLNIPVAIV AAFTLTAASI IAMRKDHLKR RLAYSTTAQL SYIVLGLSML HPYAMVGALF HIPAHAFAKL TLFFCAGAIH VETHTDYISD MAGIGKRMPL TMGAFTIGAA GMAGLPPIAG FVSKFYMLIG SGYMGGEYWI FAGTLLLSGV LNIAYFWPVV YTAFFESEDR HDAKPVLEFP RGGIFESYGD TARGQKEGVA TDGGADRSER EAPADVDDEG ADADAAAVDD ENGNEDYEYA VDQYPSDHTV PDNEQGEHVS SVDHHGDHDD HLTGGPPAGG WQRQSPFGES TWLMLAPIAI IATGAVVLGV VPDHAVFLDL ITHIVEGVFG VDSFDQLQGL SLEEALEVMS E
|
| |