Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4691 |
Symbol | |
ID | 8745287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 278985 |
End bp | 281210 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646515195 |
Product | protein of unknown function DUF162 |
Protein accession | YP_003406142 |
Protein GI | 284172760 |
COG category | [C] Energy production and conversion |
COG ID | [COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain |
TIGRFAM ID | [TIGR00273] iron-sulfur cluster-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0154904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACG CCGACGGCCG CCGTTCGAAG GCGGCGCACA TCCGCCGCCT GCTCGAGACG GAGGGCGACG CGGTCGAGGA GAACACCATC GGGTTCAACC GGGGCCGATA TGAGTCGGTG GCTGACCTCG AGGACTACGA GGAGCTCAAA TCCGAGGCGC GGGCGATCAA GGAGAACGCC ATCGAGCGGC TGCCGGAGCT GATCGACGAA CTAACCGCGA CCGTCGAGGA CAACGGCGGG ACCGTCTATC TCGCCGACGA CGCCGCCGAC GCCAACCGCT ACATCAGGGA GGTCGCGAGC GAGAAGGACG CCGATCGGCT CGTCAAATCG AAGTCGATGA CCACCGAGGA GCTCGAGGTC AACGAGGCCC TCGAGGCCGA CGGCGTCGAC GTCGTCGAGA CCGACCTCGG CGAGTGGGTG TTACAGGTGG CCGACGAGGC GCCGTCGCAC ATCGTCGCGC CCGCGATCCA CCGATCTGAA GCGGACATCG CCCGGCTGTT CAACGAGCGG TTCAATCCGG AGGAGCCCCT CGAGACCGCC GAGGAGCTGA CCGCGTTCGC CCGCGAGCGG CTGGGCAGGC TCATCCGCGA GGCGGACGTC GGGATGACCG GCGCGAACTT CATCACCGCC GACTCGGGGA CGATCGCGCT CGTCACGAGC GAGGGCAACG CCCGCAAGTC GGCCGTCGTC CCGGACACGC ACGTCGCGGT GGCGGGCGTC GAGAAGATCG TCCCCAGCGT CGAGGATCTG TCCCCGTTCA TCGAGTTGAT CGGGCGCTCG GGCACGGGCC AGGACATCAC CTCCTACATC TCCTTGCTGA CGCCACCGGT CGAGTCGCCC GTCGTCGACG TCGACGAGCC CGACGTCGCG TTCGCGGACC GCGACGACGA CCGGGAGTTC CACCTCGTGT TGATCGACAA CGGCCGCATG GCCATGCGCG AGGACGACCA GCTGCGGGAG ACGCTCTACT GCATTCGGTG TTCGGCCTGC GCCAACTCGT GTGCGAACTT CCAGTCAGTC GGCGGCCACG CCTTCGGCGG CGAGACCTAC TCGGGCGGCA TCGCGACCGG CTGGGAGGCC GGCGTCCACG GCTACGACAG CGCCGCCGAG TTCAACGACT TCTGTACGGG CTGTTCGCGC TGCGTCAACC AGTGTCCGGT GAAGATCGAC ATTCCGTGGA TCAACACCGT CGTCCGCGAC CGGCTCAACC GCGGAGGCGA AGCGGGACAG TTCGAGTTCC TGGTCGAGGG GCTCACGCCC GACGAGGAAC CGGGCGGGAT CGACCTGCAG AAACGACTGT TCGGCAACTA CGAGGCGCTC GCGAAACTCG GGAGCGCGAC CGCGCCCGTC TCCAACTGGC TCGCCGCCGC CGGACCGGCG CGAACGGTCC TCGAGCGAGT CGCCGGCGTC GACAGTCGGC GCGAACTGCC GGAGTTCAAG CGCGAAACGC TTCGAGACTG GTTCGAGAGT CGGGGGTCAC GGGTCGATGC GGCCGACGCG AAACGGGAGG TCGTCCTCTA TCCCGACACC TATACGAACC ACGTCGACGT CGACCGCGGC AAGGCGGCGG TCCGGGTCCT CGAGGCGCTG GACGTTCGCG TTCGGATCCC CGCGGTGCCC GAGAGCGGCC GCGCGCCGCT CTCGCAGGGG ATGATCGACA CTGCGGACGA GCGCGCCAGC CGGGTCTACG CCGCCCTCGC CGAACACATC GATGCGGGCC GGGACGTGGT CGTCGTCGAA CCGTCGGACC TCGCGATGTT CCGCCGGGAG TATGAGAAGC TGCTGCCCGA GGCGTCGTTC GAGCGCCTGC GCGAGGGCAG CTACGAGGTG CTCGAGTACG TCTACGGACT GCTCGAGAAC GGCGCCGACG CCGACGCGCT CGGCGGCGCC GACGCCGAGA TCGCCTACCA CTCCCACTGC CAGCAGCGGA CCCTCGGCCT CGAGCCGTAC ACGACGACGG TCCTCGAGGA GGTCGGCTAC GACGTCCTCG AGAGCGACGT CGAGTGCTGC GGGATGGCCG GCAGCTTCGG CTACAAGTCC GAGTACTACG AGCTGAGCGT GGACGTCGGC GACCGCCTCC GCGAGCAGTT CACCGAGCCC GAGGCGCGGG ATCGGATCGT TGCCGCGAGC GGCACCTCCT GTGAGGACCA GCTCGGCTCG CTGCTCGAGC GCGACGCCGT CCACCCGGTG GAACTGCTCG ATCCGCGGCG GCGAAGCGGT CGCTGA
|
Protein sequence | MSDADGRRSK AAHIRRLLET EGDAVEENTI GFNRGRYESV ADLEDYEELK SEARAIKENA IERLPELIDE LTATVEDNGG TVYLADDAAD ANRYIREVAS EKDADRLVKS KSMTTEELEV NEALEADGVD VVETDLGEWV LQVADEAPSH IVAPAIHRSE ADIARLFNER FNPEEPLETA EELTAFARER LGRLIREADV GMTGANFITA DSGTIALVTS EGNARKSAVV PDTHVAVAGV EKIVPSVEDL SPFIELIGRS GTGQDITSYI SLLTPPVESP VVDVDEPDVA FADRDDDREF HLVLIDNGRM AMREDDQLRE TLYCIRCSAC ANSCANFQSV GGHAFGGETY SGGIATGWEA GVHGYDSAAE FNDFCTGCSR CVNQCPVKID IPWINTVVRD RLNRGGEAGQ FEFLVEGLTP DEEPGGIDLQ KRLFGNYEAL AKLGSATAPV SNWLAAAGPA RTVLERVAGV DSRRELPEFK RETLRDWFES RGSRVDAADA KREVVLYPDT YTNHVDVDRG KAAVRVLEAL DVRVRIPAVP ESGRAPLSQG MIDTADERAS RVYAALAEHI DAGRDVVVVE PSDLAMFRRE YEKLLPEASF ERLREGSYEV LEYVYGLLEN GADADALGGA DAEIAYHSHC QQRTLGLEPY TTTVLEEVGY DVLESDVECC GMAGSFGYKS EYYELSVDVG DRLREQFTEP EARDRIVAAS GTSCEDQLGS LLERDAVHPV ELLDPRRRSG R
|
| |