Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5151 |
Symbol | |
ID | 8745699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | - |
Start bp | 46120 |
End bp | 47976 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646515508 |
Product | NADH/Ubiquinone/plastoquinone (complex I) |
Protein accession | YP_003406455 |
Protein GI | 284176178 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0571898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGAAA GCGATCTCCT GACGATGGCT TACCCGCCGC TGCTGGTGTT CGCGGCGGCG TTGCTCGTGC TCGTACTGCC CCGAATCGCC GGCTTCGCCG CCGGCGCGCT GAGCCTCGCG GGCGTGTTGG CGATCTCGGT GTACGCCCCC GAAGGGAGTT ACCTCACCGG CACCTTCCTC GGGTTCGACG TCGTCGCCTT CCACGTCGAC GGCTTCTCCC AGATGATCGG CATCGGACTG GGCTTCCTCG GGATCTGTTC GGTCATCTAC GCCTACTCGA GCGGTGCCAG CCGCGAACTG GTCGCGATCG CGCTCACCTA CGTCGCCTCC TCGCTCGGGG CGGCGTTCGC CGGCGACTGG CTCGTGCTCG TGTTCATGTG GGAGCTGATG GCCGTGACGA GCACGCTGGT CGTCTGGCAC TACGGCGGCG ACGCCGTCCG GGCCGGCTTC CGCTACGCGT TGTTCCACGG CACCGGCGGC GTGATCGTGC TGCTTGCCGT CGCCGCCCAC TACGTCGAGG CCGGGACGTT CGTCTACGAC GGAACCGGGA TCGCGTCCGG GCTGCCGGCG ATGCTCGCGG TGCTCGGAAT GGGCGTCAAC GTCGGCTTTA TCGGGCTCCA CACGTGGCTG CCCGACACCT ACCCGCGGCC GCACTTCGCG GCGTCGGTGT TCCTCTCGGT CTACACCACG AAGACGAGCG CGTTCGTCCT CTACCGGGCG TTCCCCATCG GCGCCGAGAG CGAACTGGGA ATCTACATCG CGTACATGGG CGGACTGATG TCCGTCTACG GGGCGACGTT CGCCCTGTTG CAACACGACA TGCGGGCGCT GCTGTCCTAC CACATCCAGG CCCAGCTGGG CTACATCGTC GCCGGAATCG GGATGGGCGC CTGGATGGTC GAAACCGAGA TCGCCACCGC TGGGGCGATG AGCCATCTGT TCAACAACAT CCTGTTCAAG AGCCTGCTGT TCATGGCCGT CGGCGTCGTC ATCTTCCGCA CCGGCGAGGA GGACCTCTAC AAACTCGGCG GGCTCTGGCG CGAGATGCCG CTGACGGCGA TCGGATTCGG CCTCGGCGCG CTCTCGATCA CCGCGATCCC CGGCTTCAAC GGGTACATCA GCAAGGGAAT GCTCTTCGAC GCGGCCGATC CCCACTACTA CGGGGTCCAC GAGTTCGAGG CGCTGTACTG GCTGCTCTGG ATCGGCGCGA TCGGGACCCT CCTGTCGTTC ATCAAGCTCG GCTACTACGT CTTCTTCCAC GGAGAGAGCG ACATCTCGGT GCCCGACGCC AGACCCGGCC AGACGATCGC CATGCTCGGA CTGGGTGGGG CCTGCCTGCT CTTCGGCGTC TGGTGGCAGG GACTGGCCGA CCTCGCGCCG ACGATCCACG CCCACGGCGG TGAGTTCTCG TTCGTCTACC CGAACGACGG CGAGGGCCAC CTCCACCCCT ATAGCTCGAG CCACCTCGAG ACGGCGGGAA TTCTGACGGC AGTCGCGCTC GTCACCTTCG CCGTCGTCCG CAAACCGCTC TCGAAGCTCG ATCTGGGCGA TCCGGCGCGG GTCGTCTTCC CCGCGGGCTA CTACGTCGGC CGGTGGTCGA TGCTCGCGAC GACCGAACTC TACCGGGTCG TCGACGCCGC GGCCGTCGGC CTCGTCAAGC GCTGCTACTG GATCGGCAAC AACCCGGTGC TGGCCGTCGA CGCCGCCGCT CGCCGCGTTC CCGGGGTCGA CGTCGAGGAT CGGCAGCCGA CCGATGGCGG TCGACCGTCG ACGATTCACC TCCGGGCGAG CATCGGGACC ACCGTCCTCC TGCTGACGGT CGTGCTGACG GTGATCCTCC TGTTGCTCCT CGTCTAA
|
Protein sequence | MIESDLLTMA YPPLLVFAAA LLVLVLPRIA GFAAGALSLA GVLAISVYAP EGSYLTGTFL GFDVVAFHVD GFSQMIGIGL GFLGICSVIY AYSSGASREL VAIALTYVAS SLGAAFAGDW LVLVFMWELM AVTSTLVVWH YGGDAVRAGF RYALFHGTGG VIVLLAVAAH YVEAGTFVYD GTGIASGLPA MLAVLGMGVN VGFIGLHTWL PDTYPRPHFA ASVFLSVYTT KTSAFVLYRA FPIGAESELG IYIAYMGGLM SVYGATFALL QHDMRALLSY HIQAQLGYIV AGIGMGAWMV ETEIATAGAM SHLFNNILFK SLLFMAVGVV IFRTGEEDLY KLGGLWREMP LTAIGFGLGA LSITAIPGFN GYISKGMLFD AADPHYYGVH EFEALYWLLW IGAIGTLLSF IKLGYYVFFH GESDISVPDA RPGQTIAMLG LGGACLLFGV WWQGLADLAP TIHAHGGEFS FVYPNDGEGH LHPYSSSHLE TAGILTAVAL VTFAVVRKPL SKLDLGDPAR VVFPAGYYVG RWSMLATTEL YRVVDAAAVG LVKRCYWIGN NPVLAVDAAA RRVPGVDVED RQPTDGGRPS TIHLRASIGT TVLLLTVVLT VILLLLLV
|
| |