Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5195 |
Symbol | |
ID | 8745743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | - |
Start bp | 85043 |
End bp | 86896 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646515552 |
Product | conserved hypothetical protein-like protein |
Protein accession | YP_003406499 |
Protein GI | 284176222 |
COG category | [S] Function unknown |
COG ID | [COG3390] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGA ACGGCAACGG CGAGGACGAC GAGGAGATTC CGGGACGCGA AGTCGCCTAC CGACTGTTCG CCGCCGAGTA CGACGACGCG TCGTTCTCCT ACGCCGAGAG CGACGAGGAA CGGGCACCAA ACTACGTCAT CTCGCCGACC GGCGCGCGGC TCAACCGCGT CTTCACCGTC GGAACACTCA CCGAGATCAC CGCGGTCAAC GACGAGATGG TCCGTGCCCG CGTCGTCGAT CCGACCGGCG CCTTCGTCGT CTACGCCGGT CAGTACCAGC CCGACGAACT CGCGACGCTC GAGCAACTCG AGCCGCCCGA ATTCGTCGCG GTAACCGGGA AGGCCCGAAC CTTCCAGCCC GACGACTCCG ATCAGGTCTA CACCTCGCTC CGTCCGGAGA GCATCGCGAC GGTCGATGCC GACACCCGGG ACCGCTGGGT CGTCAGCGCG GCCGAGCAGA CTGTCGAGCG CGTCGGCACC TACGCCGCGG CCGCAGAAAG CGACGCGAGT GGAGACGCCC TGACCGACGC GCTCCTCGAG GCCGGCGTCG ACAAGGGCCT CGCGGCGGGG ATCCCGCTCT CTCAGGACCA CTACGGGACG ACGCCGGACT ACCTCGCGGC CCTGCGCGAC TGCGCCCTCG AGGCCGTCGA GGTCGTCGCG GGCGAGCGCG ATCAGGTCGA GGCGTTCTCG CTCGCGCCCG ACGGCTCGGG TCCCGACGCC GACGCCTCGT TCGCGTCGCT GGCCGACCTC GTCGACCTCG ATCTCGGGGG TCTCGAGTCG GCCGCTGCCC CCGAGGCCGA ATCCGCAGCC GAATCCGTGC CCGAGCCGGA ACCGGCCGCC GCCACGGCCG GCACCGGTTC CGCCGCGGGG ACGGCGACGA GCACCGACCG CGAGTCCTCG TCGGCCGACG TCGACGTCGA CTCCGACAGG GCGGACGCCG CCGAGACGGA AACCGAGCCC GAGACCGAGA CGGAGACGGA GCCACCGACG GAGTCGACTC CTGAGAGCGC CGAGCCGGCC GCTGAGACCA CTGAGGCGAC TGCCACCGGT ACTGGGGTGA CGGACGTCGA AGCAGATGCG ACCGCTGACG TCGCCGATTC CGACGAACCG ACTACCTCCG ACTCCGAACC GGCCGAATCG ACCGCCGCGT CCGACGAGAC GGACGTCGGC GACTTCGAGA CGACGAGCGA CTCCACCGAG ACCGATGCCG ACCTCGAGAC CGGCACCGAA GCCGACGACG AGGAACTCGG CGACTTCGAA GCCGACGACG TCGACGACGG CGGGATGTAC GAGATGGACG AGGAGGAGCG CGAACAGCTC GAGGAGGAAT TCGGCGCCGA GTTCACGACC GGCGCGGAGG TCGAGGAACC CGGTGAGGCC GACATCGACG TGCCCGAACC CGACGACGAG CCCGTCGACG AGTTCGAGAC CGAAAGCGAG GCGGCTGCGG ACGCCGCGAC GACCGATGAA CTCGACGCGT CCGACGAAGC TGCAACCAAG TCGGCCGACG ACGACCTCGG TGCGCCGCCG GCGTCGGGAC TCGAGGCCGA CGCGGAACCC GCGGAACCGA ACGAAGACGA GGAACCCGAC GAACCGTCGG AGGGGGGATC CGAAGAGGCC ACGGCCGACG AACCCGCGGA CGAGGAACCC GCCGAGGACG TCGATCTCGA GGAGTACGTC GTCGAAACGA TGGAGGACAT GGACGACGGC GACGGCGCCG ACCGGACCGA ACTCGTCGAG CGAGTCGCCG ACGAGACCGG CGCGTCCGAG GACGAGGTCG AGGACGCGAT CCAGGACGCG CTGATGGGCG GGCAGTGTTA CGAGCCCAAC GACGAGACCC TGAAGGCGAT CTGA
|
Protein sequence | MSANGNGEDD EEIPGREVAY RLFAAEYDDA SFSYAESDEE RAPNYVISPT GARLNRVFTV GTLTEITAVN DEMVRARVVD PTGAFVVYAG QYQPDELATL EQLEPPEFVA VTGKARTFQP DDSDQVYTSL RPESIATVDA DTRDRWVVSA AEQTVERVGT YAAAAESDAS GDALTDALLE AGVDKGLAAG IPLSQDHYGT TPDYLAALRD CALEAVEVVA GERDQVEAFS LAPDGSGPDA DASFASLADL VDLDLGGLES AAAPEAESAA ESVPEPEPAA ATAGTGSAAG TATSTDRESS SADVDVDSDR ADAAETETEP ETETETEPPT ESTPESAEPA AETTEATATG TGVTDVEADA TADVADSDEP TTSDSEPAES TAASDETDVG DFETTSDSTE TDADLETGTE ADDEELGDFE ADDVDDGGMY EMDEEEREQL EEEFGAEFTT GAEVEEPGEA DIDVPEPDDE PVDEFETESE AAADAATTDE LDASDEAATK SADDDLGAPP ASGLEADAEP AEPNEDEEPD EPSEGGSEEA TADEPADEEP AEDVDLEEYV VETMEDMDDG DGADRTELVE RVADETGASE DEVEDAIQDA LMGGQCYEPN DETLKAI
|
| |