Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1939 |
Symbol | |
ID | 8742537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2010997 |
End bp | 2012760 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646512521 |
Product | protein of unknown function DUF181 |
Protein accession | YP_003403497 |
Protein GI | 284165218 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTAC ACGTCGTCGG TGACGATCCG GTCCGCGAAG CGGTCGTCAC CGCGCTGGGA GACGTCGACG TCACCGTCGA AGACGCGACA GCCGACGACC TCGAGGACGC CCGCTTCGCG GTCGTCAGCG ACGTGGCCGG CGCGGCCACG TTCCGGCGAG CGAACGCCGC CGCGCGCACC GGCGGCACGC CGTGGATCGC CGTCGAGGTC GGCGGCGTCG GCGGCCAGCC ACTCGCCGGC GTCGACTCGG CCGTCTCGGG CTTCGCGCCC GCGACGGGCT GTTTCGACTG TCTCCACCAG CGCGTCGCCG CGAACCTCGA GGAGGGCGAG CGAAGCGACC GACCGCAGGC CGACCGCAGC ACGGCCCGTC TCGCCGGCGC GCTCGCGGGC CGGGAGTGCG TCCGGGTCCT GTCGGGCGAC GACCGGTCGG TGATCGGCCA CGTCGTCGAA CTGCCCCATG CGAGGCGGCG GGTCCTCCCC GTACCCGGCT GTGAGTGTCA GAACGAACCG CGGGATCGGA CCCTCGAGCG GGACGACGAC GCGCTCGCCC TCGATGCGGC CGTCGAGCAC GCCGAGGAGA CGATCGACGA CCGCGTCGGG ATCGTCAGGA CCATCGGCGA GATCGAATCG TTCCCCGCGC CCTACTACCT CGCGACGACG ACCGACACGC AGGGGTTCAG CGACGCGAGC GCGCCGACGC AGGCGGCCGG CGTCGCCGAC GACTGGAACG CGGCGCTGAT GAAAGCCGTC GGCGAGGGCC TCGAGCGCTA CTGCGCCGGC GTCTACCGGG ACAGCGAGTT CGTTCACGCA AGCGAGGACG ACCTCGAGAA CGCGGTGTCC CCGACGGATC TCGTGCGGCC GGACGACGCG CCCGCCTACG ACGCGAGTGA CGAGCACCGC TGGGTACCGG GCGAGAATCT CTCGACCGGC GACGACGTCC ACCTGCCGGC CGCGGCGGTC CAGTTCCCTC AGCCCGGCGA GGCGCTCGTT CCCGGCATCA CGACGGGACT CGGACTCGGC TCGTCGACCG TGGACGCCCT GCTGTCGGGC CTGACCGAGG TCATCGAGCG GGACGCGACG ATGCTCGCGT GGTACTCGAC GTTCGAGCCC CTCGGGCTGA CCGTCGACGA CGACGGCTTC GGCGTCCTCG AACGCCGCGC CCGCGGCGAG GGGCTGTCGG TGACACCGCT GTTGGTCACG CAGGACGTCG ACGTCCCCAT CGTCGCCGTC GCCGTCCACC GCGATCCGGA CGCTCTCGAG GGGGCCGTCG AGCCGACCGC CGACGAGTGG CCGGCGTTCG CCGTCGGCTC CGCCGCGGAC CTCGACGCCA CGGCCGCTGC CCGGTCGGCC CTCGAGGAGG CGCTGCAGAA CTGGATGGAG ATCCGGAACC TCGGCCCCGA AGACGTTTCC GACGCTTCGG GCGCCATCGG GGAGTACGCG GCGTTCCCCG AGGCCGTTCG CGGGTTCGTC GACGTCGAGC GGACGGTTCC AGCCGAAAGC GTCGGTCCGG AGACCGCGCC AGAGGGACGG GCGGCACTGA CGGAACTCCT CGAGCGGACG ACCGACGCCG ACCTGACGCC GTACGCGGCG CGGCTGACGA CCCGAGACGT CGCGGAGACC GGCTTCGAGG CGGCCAGAGT CGTCGTCCCC GGCGCCCAAC CGCTGTTCAC CGGCGAACCG TTCTTCGGCG AGCGGGCGCG GACGGTTCCG GCGGACCTCG GCTTCGAACC GCGCCTCGAG CGCGCGTTCC ATCCCTACCC CTGA
|
Protein sequence | MNVHVVGDDP VREAVVTALG DVDVTVEDAT ADDLEDARFA VVSDVAGAAT FRRANAAART GGTPWIAVEV GGVGGQPLAG VDSAVSGFAP ATGCFDCLHQ RVAANLEEGE RSDRPQADRS TARLAGALAG RECVRVLSGD DRSVIGHVVE LPHARRRVLP VPGCECQNEP RDRTLERDDD ALALDAAVEH AEETIDDRVG IVRTIGEIES FPAPYYLATT TDTQGFSDAS APTQAAGVAD DWNAALMKAV GEGLERYCAG VYRDSEFVHA SEDDLENAVS PTDLVRPDDA PAYDASDEHR WVPGENLSTG DDVHLPAAAV QFPQPGEALV PGITTGLGLG SSTVDALLSG LTEVIERDAT MLAWYSTFEP LGLTVDDDGF GVLERRARGE GLSVTPLLVT QDVDVPIVAV AVHRDPDALE GAVEPTADEW PAFAVGSAAD LDATAAARSA LEEALQNWME IRNLGPEDVS DASGAIGEYA AFPEAVRGFV DVERTVPAES VGPETAPEGR AALTELLERT TDADLTPYAA RLTTRDVAET GFEAARVVVP GAQPLFTGEP FFGERARTVP ADLGFEPRLE RAFHPYP
|
| |