Gene Htur_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4237 
Symbol 
ID8744865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp506230 
End bp508761 
Gene Length2532 bp 
Protein Length843 aa 
Translation table11 
GC content56% 
IMG OID646514782 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003405729 
Protein GI284167451 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.28007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCAT CGGGTTCTTC GGGGGATGTC TATGCTGAAA CGCTCGCCGT CTTCGACCAG 
CGCGACGATC CCTATGAACC ATTCACGACG CCAGAGGTGG CAGACTCGCT CAATACCGCC
AGACGAACTG TCTACAAACG CTTGGAGAAG CTGGTGAACC GCGGTGATCT AAAGACGAAG
AAGGTAGGTG CCAACGCCCG GATCTGGTGG AAATCAACAA GCCCGCCCGG AAATCACACA
CCAAAGAAGG CAAAGCACCA GCGTACCGAG TTTGAATCTA TCGTCGAAAA CGTGCTGGAA
CGAAATACGG ATGGATTCTA CAGCCTCGAC GAAGACCTTC AATTCACGTA TGTAAACACC
CGAACGGAAG AGCTACTGGA CTTGGAGGAG TCCGCTGTTC TTAGTCAGAA CATTCACGAT
ACACCCCTCT TGACTGATGC GTTCGAGGCT GCGCTTCACG AGGCGTCCGA GACCCGGGAG
CCTGTCATTG TAGAAGACTA CTACGCCCCA TTCGAGGTGT GGTTCGAGAA CGCCATTTAC
CCATCAGAAA CGGGTCTGTC GGTCTACTTC CGAGATATTT CGGACCGAAA GCGCTTGGAA
CACGACCTTC AAACGGAGCG AGACCACTTC CGCGTAGTAA TCGAAAATTC ACCGATCGTT
GCATTCCGAA TGGACTCGGA CCTGCGGTAC ACCTGGCTTT ACAACCCGGA CCAGGACTTC
GAAGATGTAG ACGTAACGGG CAAACGCGAT GATGAGCTAT TACCGCCTGA GACGGCAGAA
ATGCTCATGG CCCCGAAGCG AGCCGCTCTC GAAACGGGCG AGCAGGTCCG AGAGGAACGG
ACCTACGAGT TACCGAGTGG TGAGGTCACG TACGATCTCA CGGTTGAACC TGACCGCGAC
GAAACGGGTG AGATCGACGG CATATCGTGT GTCGCCGTGG ATATTACCGA CCAAAAGCAG
CGCGAACGAG ATCTCAAACG CTACGAGCGG ATTGTCGAGA CCGTCCCGGA CGGCGTCTAC
GCCCTGGATT CCGACGACCG GTTCATCCTC ATCAATCAGG CGTTCTGTGA GCTCGTGGGA
TACGACCGCG AGACGCTGCT GGGCGCTCAC TCCACGCTGA TCGAGAACCA GGCAGTGAAC
GATACTGCGA ACGCCCTCCA GGCGGAGATC CAGGCAGGTG AGCGCGATGT CGGCGTGATC
GAAGCAAAGT TCGAAACCGC CACCGGTGAG ACGGTTCCGG TGGAGAGCCG AATCGCCCCA
TTCGAGCACG CCGATGGCCG CGTCGGTCGG TGTGGGGTCG TCCGCGACGT CACCGAGCAG
ATACGACGAG AAGAGGAACT CACCGCGCTG AACCGCCTCA ACTCTGTGTT TCAGGACGTC
ACACACGCTA TTGTTGAGTC GTCCTCACGT GAGGAGATCG AGCAGACGAT CGCAGAGCAC
CTCACGAACT CCGACTCCTA TGAGTACGCC TGGATAGGCC ACCTCGACCG GCACGGCGAA
AGAATCCTCC CACAGATCGT CGGTACGAAC GATGTCGAGC TCCCCGAGAT ACCTCTCTCC
TCCGCGACCG ATAATTCGAC GAGTTACCCA CCGGCCGCTG AAGCAATGCG GACCGGAAGA
GTTCAGGTCA CAGACGATAG CATCGCCGAT CCCGTGCTTA ACCAGTGGAA GCAGTCCCAA
GACATCCGCC ACCGGGCGGG AATTTCAATT CCGATTGCCT ATGAAGAGCG CGTCCACGGC
GTCCTAAATG TCTACACGGC GCGAGAGAAC GCATTCGATG AAAACGAACG CCGCATCGTG
AGACGAATCA GTGAGGTCGT TGGGCACTCA ATCAGTGCAA TCGAACGGAA GCGGGCCCTA
CTGGAAGACC GCGTCCACGA AATCACGTTC CGATCTCACC GGTTTACAGA GACCCTCACC
GATGCTGCTG GTGACGAGTC GTTCACCGTC TCTATCGAGA AGTCCGTTGC GCTCCCTGAC
GACCAGAGCA TCGCATACTA CTCTCTCGAT GGGCTTGATC CGGCGGTTTT CATTGACGTC
ATCGAACAGT ACAATCCCGA CGGTGAGTAC CACGTGACCG ACGAGGAAGG GTCTCGGGCT
CGCGTCGAAG TCCAACAGAG TAACTTGACG CTTGCAGCCG AACTCGCCAA GTACAACAGC
TGGATTGCAG ACGGAACTCT CCAGAACGGG GAGTTTCGAC TCAGAGTCCA AGTGCCACAA
TACGCCCAAG TCCGAGAGGT GAAAGACATC GTTACGGAAG CGTATCCGGA CGTCGAGGTC
CTCGCACAAA TTGAAGTCGA GCGAGAAAGT ACACGTCTGA GTGATGTCTT TTCTGACCTA
GACGACCAAT TGACCGAGCG ACAGCGGACT GCGCTGGAAG TAGCGTACTA CTCGGGGTAT
TTCGATTGGC CACGCGCAAT TACGGGTGAA GAGTTAGCCG AGCGGTTAGA TGTAACTCCA
GGCACTGTCT CCCATCACCT TCGGCACGGA GAACACAAGT TACTGTCCGC ATTTTTTGAC
CTCGCAGAGT AG
 
Protein sequence
MSSSGSSGDV YAETLAVFDQ RDDPYEPFTT PEVADSLNTA RRTVYKRLEK LVNRGDLKTK 
KVGANARIWW KSTSPPGNHT PKKAKHQRTE FESIVENVLE RNTDGFYSLD EDLQFTYVNT
RTEELLDLEE SAVLSQNIHD TPLLTDAFEA ALHEASETRE PVIVEDYYAP FEVWFENAIY
PSETGLSVYF RDISDRKRLE HDLQTERDHF RVVIENSPIV AFRMDSDLRY TWLYNPDQDF
EDVDVTGKRD DELLPPETAE MLMAPKRAAL ETGEQVREER TYELPSGEVT YDLTVEPDRD
ETGEIDGISC VAVDITDQKQ RERDLKRYER IVETVPDGVY ALDSDDRFIL INQAFCELVG
YDRETLLGAH STLIENQAVN DTANALQAEI QAGERDVGVI EAKFETATGE TVPVESRIAP
FEHADGRVGR CGVVRDVTEQ IRREEELTAL NRLNSVFQDV THAIVESSSR EEIEQTIAEH
LTNSDSYEYA WIGHLDRHGE RILPQIVGTN DVELPEIPLS SATDNSTSYP PAAEAMRTGR
VQVTDDSIAD PVLNQWKQSQ DIRHRAGISI PIAYEERVHG VLNVYTAREN AFDENERRIV
RRISEVVGHS ISAIERKRAL LEDRVHEITF RSHRFTETLT DAAGDESFTV SIEKSVALPD
DQSIAYYSLD GLDPAVFIDV IEQYNPDGEY HVTDEEGSRA RVEVQQSNLT LAAELAKYNS
WIADGTLQNG EFRLRVQVPQ YAQVREVKDI VTEAYPDVEV LAQIEVERES TRLSDVFSDL
DDQLTERQRT ALEVAYYSGY FDWPRAITGE ELAERLDVTP GTVSHHLRHG EHKLLSAFFD
LAE