Gene Htur_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1939 
Symbol 
ID8742537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2010997 
End bp2012760 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content73% 
IMG OID646512521 
Productprotein of unknown function DUF181 
Protein accessionYP_003403497 
Protein GI284165218 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTAC ACGTCGTCGG TGACGATCCG GTCCGCGAAG CGGTCGTCAC CGCGCTGGGA 
GACGTCGACG TCACCGTCGA AGACGCGACA GCCGACGACC TCGAGGACGC CCGCTTCGCG
GTCGTCAGCG ACGTGGCCGG CGCGGCCACG TTCCGGCGAG CGAACGCCGC CGCGCGCACC
GGCGGCACGC CGTGGATCGC CGTCGAGGTC GGCGGCGTCG GCGGCCAGCC ACTCGCCGGC
GTCGACTCGG CCGTCTCGGG CTTCGCGCCC GCGACGGGCT GTTTCGACTG TCTCCACCAG
CGCGTCGCCG CGAACCTCGA GGAGGGCGAG CGAAGCGACC GACCGCAGGC CGACCGCAGC
ACGGCCCGTC TCGCCGGCGC GCTCGCGGGC CGGGAGTGCG TCCGGGTCCT GTCGGGCGAC
GACCGGTCGG TGATCGGCCA CGTCGTCGAA CTGCCCCATG CGAGGCGGCG GGTCCTCCCC
GTACCCGGCT GTGAGTGTCA GAACGAACCG CGGGATCGGA CCCTCGAGCG GGACGACGAC
GCGCTCGCCC TCGATGCGGC CGTCGAGCAC GCCGAGGAGA CGATCGACGA CCGCGTCGGG
ATCGTCAGGA CCATCGGCGA GATCGAATCG TTCCCCGCGC CCTACTACCT CGCGACGACG
ACCGACACGC AGGGGTTCAG CGACGCGAGC GCGCCGACGC AGGCGGCCGG CGTCGCCGAC
GACTGGAACG CGGCGCTGAT GAAAGCCGTC GGCGAGGGCC TCGAGCGCTA CTGCGCCGGC
GTCTACCGGG ACAGCGAGTT CGTTCACGCA AGCGAGGACG ACCTCGAGAA CGCGGTGTCC
CCGACGGATC TCGTGCGGCC GGACGACGCG CCCGCCTACG ACGCGAGTGA CGAGCACCGC
TGGGTACCGG GCGAGAATCT CTCGACCGGC GACGACGTCC ACCTGCCGGC CGCGGCGGTC
CAGTTCCCTC AGCCCGGCGA GGCGCTCGTT CCCGGCATCA CGACGGGACT CGGACTCGGC
TCGTCGACCG TGGACGCCCT GCTGTCGGGC CTGACCGAGG TCATCGAGCG GGACGCGACG
ATGCTCGCGT GGTACTCGAC GTTCGAGCCC CTCGGGCTGA CCGTCGACGA CGACGGCTTC
GGCGTCCTCG AACGCCGCGC CCGCGGCGAG GGGCTGTCGG TGACACCGCT GTTGGTCACG
CAGGACGTCG ACGTCCCCAT CGTCGCCGTC GCCGTCCACC GCGATCCGGA CGCTCTCGAG
GGGGCCGTCG AGCCGACCGC CGACGAGTGG CCGGCGTTCG CCGTCGGCTC CGCCGCGGAC
CTCGACGCCA CGGCCGCTGC CCGGTCGGCC CTCGAGGAGG CGCTGCAGAA CTGGATGGAG
ATCCGGAACC TCGGCCCCGA AGACGTTTCC GACGCTTCGG GCGCCATCGG GGAGTACGCG
GCGTTCCCCG AGGCCGTTCG CGGGTTCGTC GACGTCGAGC GGACGGTTCC AGCCGAAAGC
GTCGGTCCGG AGACCGCGCC AGAGGGACGG GCGGCACTGA CGGAACTCCT CGAGCGGACG
ACCGACGCCG ACCTGACGCC GTACGCGGCG CGGCTGACGA CCCGAGACGT CGCGGAGACC
GGCTTCGAGG CGGCCAGAGT CGTCGTCCCC GGCGCCCAAC CGCTGTTCAC CGGCGAACCG
TTCTTCGGCG AGCGGGCGCG GACGGTTCCG GCGGACCTCG GCTTCGAACC GCGCCTCGAG
CGCGCGTTCC ATCCCTACCC CTGA
 
Protein sequence
MNVHVVGDDP VREAVVTALG DVDVTVEDAT ADDLEDARFA VVSDVAGAAT FRRANAAART 
GGTPWIAVEV GGVGGQPLAG VDSAVSGFAP ATGCFDCLHQ RVAANLEEGE RSDRPQADRS
TARLAGALAG RECVRVLSGD DRSVIGHVVE LPHARRRVLP VPGCECQNEP RDRTLERDDD
ALALDAAVEH AEETIDDRVG IVRTIGEIES FPAPYYLATT TDTQGFSDAS APTQAAGVAD
DWNAALMKAV GEGLERYCAG VYRDSEFVHA SEDDLENAVS PTDLVRPDDA PAYDASDEHR
WVPGENLSTG DDVHLPAAAV QFPQPGEALV PGITTGLGLG SSTVDALLSG LTEVIERDAT
MLAWYSTFEP LGLTVDDDGF GVLERRARGE GLSVTPLLVT QDVDVPIVAV AVHRDPDALE
GAVEPTADEW PAFAVGSAAD LDATAAARSA LEEALQNWME IRNLGPEDVS DASGAIGEYA
AFPEAVRGFV DVERTVPAES VGPETAPEGR AALTELLERT TDADLTPYAA RLTTRDVAET
GFEAARVVVP GAQPLFTGEP FFGERARTVP ADLGFEPRLE RAFHPYP