Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3879 |
Symbol | |
ID | 8744507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 110761 |
End bp | 112986 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514463 |
Product | protein of unknown function DUF162 |
Protein accession | YP_003405410 |
Protein GI | 284167132 |
COG category | [C] Energy production and conversion |
COG ID | [COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain |
TIGRFAM ID | [TIGR00273] iron-sulfur cluster-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0271364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAGCG AGCGATCCCG AAAGGCCGAG CGGATTCGTC AGATCATGGC AACGGAGGGC GACAGCGTCG AACGCAACGC CCGCGGGTTC AACGAGGGCC GATACGAGTC CGTCGCCCGA CTCGAGGACT ACGACGCGTA CAAGGACCGG GCGCGGGCGA TCAAGGCGGA CGCGATCGAG CGCCTCCCCG AACTGATCGA GCGGGTGCGC GAGACGGTCG AGGAAAACGG CGGGACCGTC TACGTCGCCG AGGACGCCGC CGACGCGAAC CGGTACGTCC GCGAACTCGC CCGTGAACGG GCGGCCGAGA CCGTCGTCAA GTCCAAGTCG ATGACGACCG AGGAGATCGA TTTGAACGAG GCCCTCGCGG CCGAGGGTTG TGACGTCTGG GAAACCGACC TCGGCGAGTT CGTTCTGCAG GTGGCCGACG AGGCACCCAG CCATCTCGTC GCGCCGGCGA TCCACCAGTC CCGGGCGGAG ATCGCCGCCC TGTTCAACGA GTACTTCGAC CCGGATACGG AACTCGAGAC GGCCGAGGAG CTGACCGCGT TCGCGCGGGA GTACCTCGGC GAGCGGATCG AGGACGCGGA CATCGGTGTC ACCGGTGCGA ACTTCGTAAC CGCGGACACG GGGACGATGG CGCTGGTCAC CAGCGAGGGC AACGCCCGCA AGACCGTCGC CGTGCCCGAC ACCCACGTCG CCGTGGCGGG CGTCGAGAAG ATCATTCCGA CGTTCGAGGA CCTCCAGCCG TTCGTCGAAC TGATCGCGCG CTCGGGGACG GGACAGGACA TCACGTCCTA CGTCTCGCTG TTCTCGCCGC CGGTCTCGAC GCCGCCGGTC GACTTCGACG ACGACGGGCC GATCGCCGAC GACTCGGCCG ACCGGGAGTT CCACCTCGTC TTGCTGGACA ACGGCCGAAT GGACATGCGC GAGGACGACC AGCTCCGGGA GACCCTGTAC TGCATCCGCT GCGGTGCCTG CTCGAACTCG TGTGCGAACT TCCAGTCGGT CGGCGGCCAC GCCTTCGGCG GCGAGACCTA CTCCGGGGGC ATCGCGACGG GCTGGGAGGC CGGCGTCCAC GGTCAGGAGT CGGCCGACGA GTTCAACGAC CTCTGTACCG GCTGCTCGCG GTGTGTCAAC CAGTGTCCGG TGAAGATCGA CATCCCGTGG ATCAACACGG TCGTCCGCGA TCGGCGCAAT CGCGGGGCCG AGGACGGTCG ACTCGACTTC CTGGTCGAGG GGCTCACGCC GGACGAAGAG CCGGCCGGAA TGGACCTGCA AAAGCGCTTC TTCGGCAACT TCGCGACGCT ATCGAAACTC GGCTCCGCGA CCGCGCCCGT GTCGAACTGG GTCGCGGACA CGCTTCCTTC GCGGCTGGCG ATGGAACGCG TCCTCGGGAT CGATCGCCGC CGCGACCTGC CCGAGTTCGA TCGCGAGACG TTCGTCGAGT GGTTCCGGAA CCGCGACGTT CCACGGCCCG TCGACGCCGA CTACCACGCC GTCGTCTATC CCGACCTCTA CACGAACTAC ATCCGGACCG ACCGCGGGAA GGCGACGGTG CGAACGCTCG AGGCGCTGGG CGTCGCGGTC GACGTCCCCG ACGTCGCCTC CTCCGGCCGC GCGCCGCTCT CGCAGGGGAT GATCGCCACC GCGGAAGACC ACGCTCGCGA GGTTTCCGCG GATCTCGAGC CCTACCTCGA GGCGGGGTAC GACGTCGTCG CCGTCGAGCC CAGCGACCTC GCGATGTTCC GCGGCGAGTA CGAGCGCCTC CTCGACGAGC GGCGCTACCG GGCCCTCGCC GAGCGCAGCT ACGAGGTCTT CGAGTATATT TACGGCCTGC TCGAGAACGG GGTCGATCCC GCGCCGTTGG GACACGCCAG CGACGGGGAC GGCGCTGGCT CCGCCCTCGC GTACCACTCC CACTGCCAGC AGCGGACGCT CGAACTCGAG GCGTACACGA CGAACGTCCT CGAACGACTG GGGTACGACG TCCTCGAGAG CGACGTCGAG TGTTGTGGCA TGGCGGGGAG TTTCGGCTAC AAGCGCGAGT ACTACGACCT GAGCGTGGAT GTCGGCGAGC GACTGGGCGA ACAGTTCGAG GCGCCCGACA CCGCCGATCG AACGGTCGTG GCGAGCGGGA CGTCGTGTCT CGAGCAGTTG GACGGCCTCC TCGCTCGGCA GCCGCGGCAT CCGATCTCGC TCGTCGAACC GGACGGGTCC GAGTGA
|
Protein sequence | MSSERSRKAE RIRQIMATEG DSVERNARGF NEGRYESVAR LEDYDAYKDR ARAIKADAIE RLPELIERVR ETVEENGGTV YVAEDAADAN RYVRELARER AAETVVKSKS MTTEEIDLNE ALAAEGCDVW ETDLGEFVLQ VADEAPSHLV APAIHQSRAE IAALFNEYFD PDTELETAEE LTAFAREYLG ERIEDADIGV TGANFVTADT GTMALVTSEG NARKTVAVPD THVAVAGVEK IIPTFEDLQP FVELIARSGT GQDITSYVSL FSPPVSTPPV DFDDDGPIAD DSADREFHLV LLDNGRMDMR EDDQLRETLY CIRCGACSNS CANFQSVGGH AFGGETYSGG IATGWEAGVH GQESADEFND LCTGCSRCVN QCPVKIDIPW INTVVRDRRN RGAEDGRLDF LVEGLTPDEE PAGMDLQKRF FGNFATLSKL GSATAPVSNW VADTLPSRLA MERVLGIDRR RDLPEFDRET FVEWFRNRDV PRPVDADYHA VVYPDLYTNY IRTDRGKATV RTLEALGVAV DVPDVASSGR APLSQGMIAT AEDHAREVSA DLEPYLEAGY DVVAVEPSDL AMFRGEYERL LDERRYRALA ERSYEVFEYI YGLLENGVDP APLGHASDGD GAGSALAYHS HCQQRTLELE AYTTNVLERL GYDVLESDVE CCGMAGSFGY KREYYDLSVD VGERLGEQFE APDTADRTVV ASGTSCLEQL DGLLARQPRH PISLVEPDGS E
|
| |