Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5243 |
Symbol | |
ID | 8745791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | + |
Start bp | 139293 |
End bp | 140294 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646515600 |
Product | hypothetical protein |
Protein accession | YP_003406547 |
Protein GI | 284176270 |
COG category | [R] General function prediction only |
COG ID | [COG4026] Uncharacterized protein containing TOPRIM domain, potential nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.372333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCGT CAGCACCTGA CACGGACGCA TCGCCGTCGA ACAACCCTGA CCAGACCGCC CACAACGGCG CGTTCGAACG ACTTCGCGAG CGCCTCGAGT CCCTCGAGGC CGAACTCGAG CGCAAGGACG ACCGCATCGC GGACCTCGAG CACGACCGCG ATCGGCTCGC GGCGACCGTC GACGAACTCG TGAAGCGAAA CGCGGACCTC GAAGCGCGCA CCGATGATCT CGAGGACGAG GCCGCTGTTC TCGAGGAACG AACCACCGAC CTCGCTACCG AGACCACCGA ACTCGAGGGC CTCACCGAAG CCGCGTGCAA CAAGGCCAAC GCCAACAAGG AGCGCGTCGC GGAGCTCCAG TCCCGCGAAC TCGAGAAGGG CGCCCACCTC GAGACCGACA ACGTCGACGA GTGCGCGGTC ACCGTCGCCG ACGGCCGCCT CGAGCGCATC GCGAAGGACG ACGGGCACGC CTACTACCGC CTCCCCGAGA GCGCCGACCC GCTCGAGCGC GGCGGCGACG TCTCGCTGGC CTTCGGCGAC CTGCTCCCGA TCCAGCAGCT CGCGCGGATG GACGACGACC GCCGCCGCGC CGCGGCGAAC TCGATGCCCA CCAGGCTCGC CGCGAAGCTC TGGCGGGCCC GGACCGACCC GAGCGTCGGC GACGATCCGT GGGAGCGCGG CTGCAAGGAC GTCGCCGAGT ACGTGAAAGC CAGCGACCTC AGACACTGGA TCCGCCGCCG GGAGCCGGGC ATCTCCGAGA GCTACGCGAA GAAGCTCGTC TCGCGGACGA TCGACGCCGC CCTGGACCTC TCGAAGAACC GGCTCGCGGT GCGCCGGCAG ACCGAACGCA AGAACGGCCT CGAGTACACC GAACGACGGC TGCTCCTCCC CGCGGACGCC TCGATCCCCG GCGCGGGGAG CCGCGACGGC GCGACGGCGA CCCGAGACGA GGCCGGCGAG TCGGATCGAC CGGACCCGGA GACAACTGGC GTCCACGGCT AG
|
Protein sequence | MPSSAPDTDA SPSNNPDQTA HNGAFERLRE RLESLEAELE RKDDRIADLE HDRDRLAATV DELVKRNADL EARTDDLEDE AAVLEERTTD LATETTELEG LTEAACNKAN ANKERVAELQ SRELEKGAHL ETDNVDECAV TVADGRLERI AKDDGHAYYR LPESADPLER GGDVSLAFGD LLPIQQLARM DDDRRRAAAN SMPTRLAAKL WRARTDPSVG DDPWERGCKD VAEYVKASDL RHWIRRREPG ISESYAKKLV SRTIDAALDL SKNRLAVRRQ TERKNGLEYT ERRLLLPADA SIPGAGSRDG ATATRDEAGE SDRPDPETTG VHG
|
| |