Gene Htur_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1541 
Symbol 
ID8742132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1600217 
End bp1603432 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content66% 
IMG OID646512117 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003403100 
Protein GI284164821 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGA CTCCATCGAC CGATAGCGGA GGGGCCAACG AGCGCGCCGG CCTCGGTATC 
CGCACCGAAC TGGGCGACGG CGAACGATTC GTCCGGACCG CGATCGACGC CGCGCCGATC
GATGCGATCG TCGTCGACGG CGAGGGGACC GTCGTGTTCG CCACCGAATC GGTCGCGGAC
GTGCTCGGCT ACTCGCCGGA CGAGCTCGCG GACGAACCGT TCGCGGACTT CGTCGTCGAC
GCGGACGGGT CGGCTATCGC GCCCGCTGAC GATACCCGCG ACCGACTGGA CGGGCGCCTG
CGTTGCGCGG ACGGCGAGAC GGTCCGCGCC CGCATCGACG TTCGCGGGTT CGAGTTCGAC
GGCGAACGGT ACTATACCGC GGTGATTCGC GATTCGTCGG CGCGCGAGCC GGAGGAGCGA
ACGATCGACC GCTACGACGC CATCACCGGG ATCCCGGAGT ACGGCGTCTA TCACTTGGAT
CGCGACGGGC GCTTCGAGAT GGTCAACGAC ACGATCGTCG ACGCGCTCGG CTACTCTCGA
GACGAACTGC TCGGCGAGCA CGCGTCGGCA GTCGTCGACG AGGACGACCT CCCGGAGTGT
CGGAGCGCCG TCGAGGGACT GCTCACTGAC GGGGAACCGC GGACGGTCAC CGTCGAGTTC
GCCGCGCACA CCGCCGACGG TGACGCGATC CCCTGCGAGG CTCGCGTGAC CGCGATCGAG
CCCGAAGGGA CGGACTGCGG CACGGTCGGC ATCGTCCGCG ACGTCTCCGA TCGGAAGGAA
CGCGCCGAGG AACTCAGGCA AGAACGGGGG TTGAACGAAC ACGTCCTCGA GACGAGTCCC
GTCGGCATCG GCGTGATCAC GCCCGACGGC GATATCTCCC GCGTCAACGA CCGGGCCGAG
GCGCTACTGG GGTTGACCAT GGAGGAACTC TCCGACCAGA CGTTCGACGT CTCCCAGCGG
AAGCTGTACG ATTCGGAGGG CCATCGGATC TCGCCGGAAG ACCTGCTGAG CCGGGTGTTC
GACGACCACG AAGAGGTTAT CGACACCGAA TTCGCCCTCG AGCGACCCGA CGGCGACCAG
GTCTGGGCCT CCATCAGCAT CGCACCGATG ACGGACGCCG CCGGCGACGT CGAGAAGGCG
GCGATCATCG CGACGGACAT CACGGATCGG AAGGAGCGCG AGGAGACGCT CCGGGAGGAA
CGCGACGTCA TCGAGCACAT CCTCGAGACG AGCCCCGTCG GTATCGGCGT GATCACGGCC
GACGGCGACA TCTCCCGCGT CAACGACCGG GCCGAGGAGC TGTTGGGGCT GACGATCGGG
GAGATCACGA ATCAGACGCT CGACGTCACT CAGCGCAGGT TCTACGGCGC GAGCGGCCAG
CAGGTACAGC CCGAGGACCT GCTGAGCCGG GTGTTCGAGG ACCGCGAACA CGTTCTCAAC
TCCGAGTTCA GACTGGAGCG TCCCGACGGG GAGCGCGTCT GGACGGCGCT CAGCATCGCG
CCCATCGAGA ACCAGGGCGG CGACGTCGAG AAAGCGGTCG TCATCGCGAC GGACATCTCC
GACCGCAAGG AGCGCGAGAA GCGGTTGCGA GAGAGCGAGG CGCGACTCCG TCAGATCGCC
GAGAACATCA ACAGCGCCAT CTGGATGGCG GACGCGGACC TGAGCGAGAT CCTGTACATC
AATCCCGCCT ACGAGAACAT CACCGGCCGC TCGCGGGACT CCGTGTACGA CAACCTGATG
AACCACCTCG ACGACGTCCA CCCGCAGGAC CGACACCGCG TCGAAACGGC GATGCAGGAA
GTGACGCAAA CGCCCCGGAA CGACGGCACC GCGATCCGGT TTCAGGAGAA GTACCGCATC
GTCCAGCCCG ACAGCAGCAT CCGGTGGGTC ACCAGCTTCG CGTTCCCGCT GCAGAACGAC
GACGGCGACG TGTACCGGTT CGTCGGCGTC ATCGACGACA TCACCGAGGT GAAAGAACAG
CAGCTGGAAC TCGGCCGTCA GCGCGACGAA CTGGAGACGC TCAACCAGAT CAATACGGTC
ATCCGACGCA TCAACCAGGG GGTAGTGCAG GCGGCCGACC GAGCGGCGAT CGAACGAGAG
GTCTGCGAGA CGCTCACCGA CTCGAAACTG TATCACGCGG CGTGGACCGG CGAAGTCGAT
ACCGGGACCC GCGAGGTCAC CCCGAAGACG GACGACGGGC TCGAGACGGC GTCGATCGAT
CGGTCGTTCG ACATCGACGC GGTCGACGCG ATCTCGATGG CGGTCGAATC GGGCGATATC
CAGCTCATCC GGAACGTCGC GGCGCTCCCC GAGGAACTTC CCGTGACCGA CGCCAGTCCC
AACGGTGCCT TCGAGAGCGA GCACTCGTCC GCAGCGGTGA TTCCGCTGAT CTACAAGGAG
ACGGTCTACG ACGTGCTCGT CGCGTACTCG TCGCGCGCGA ACGCGTTCAG CGTCAGGGAG
CAGGCCGTGC TGTTCGAACT CGGCAAGACG ATCGGACTCG CGATCAACGC CGTCGAACGG
AAGGCGGCGC TGCTCACCGA CGCGGTCGTC GAACTCGAGT TCGAGATCCG CGATCCGGAC
GTGTTCTTCG TGAGCGCGTC GGACGAGCTC GGGGTCGAAT TCGAAATGGA GGGCATCACG
TCGCAGTCCG ACGGGACGTA TCTCCAGTAT TTCACCGTCA CTGGGTGCAA GCCCGACCGC
GTGCTCGAAC GCGCCGGCGA CGAGCCCGGC ATCGAGCGCG CTCGCATCGT CGCCGAAGAC
GAGGACGAGA ACGGGGCGCT CGTCGAGTTC ATCGTCGGTG ACTCGTCGCT GGCGACCGCG
CTCGCCGAAT ACGGGGGCAC TGTCCGGTCC GCACGGTTCG CGGAGGGACG TGGGACCTGC
GTCTCGGCGT TCTCTCAGAC GGCCGACGTC AGGGAGGTCG TCGAGGCCGC CCGGTCGACG
TTCGCTCGGA CGGAACTCGT CGGCAAGCGG GAACGCGAGC GATCGGTCCA CACCGGTCGG
GAGTTCCGGA CCGCACTCGA GGAGTTGCTC ACGGAACGCC AGCGGACGGT ACTCGAGACG
GCGTACTACG CGGGCTACTT CGAGTGGCCG CGGGACAGCT CCGGCGAGAA CGTGGCCGAT
TCGCTCGACG TCGCACCGGC GACGTTCCAC CAGCACATTC GCGAGGGGGT TCAGAAGCTG
GTCGAAACGC TGATCGAAGG CGCCGCGGCC GCGTGA
 
Protein sequence
MTETPSTDSG GANERAGLGI RTELGDGERF VRTAIDAAPI DAIVVDGEGT VVFATESVAD 
VLGYSPDELA DEPFADFVVD ADGSAIAPAD DTRDRLDGRL RCADGETVRA RIDVRGFEFD
GERYYTAVIR DSSAREPEER TIDRYDAITG IPEYGVYHLD RDGRFEMVND TIVDALGYSR
DELLGEHASA VVDEDDLPEC RSAVEGLLTD GEPRTVTVEF AAHTADGDAI PCEARVTAIE
PEGTDCGTVG IVRDVSDRKE RAEELRQERG LNEHVLETSP VGIGVITPDG DISRVNDRAE
ALLGLTMEEL SDQTFDVSQR KLYDSEGHRI SPEDLLSRVF DDHEEVIDTE FALERPDGDQ
VWASISIAPM TDAAGDVEKA AIIATDITDR KEREETLREE RDVIEHILET SPVGIGVITA
DGDISRVNDR AEELLGLTIG EITNQTLDVT QRRFYGASGQ QVQPEDLLSR VFEDREHVLN
SEFRLERPDG ERVWTALSIA PIENQGGDVE KAVVIATDIS DRKEREKRLR ESEARLRQIA
ENINSAIWMA DADLSEILYI NPAYENITGR SRDSVYDNLM NHLDDVHPQD RHRVETAMQE
VTQTPRNDGT AIRFQEKYRI VQPDSSIRWV TSFAFPLQND DGDVYRFVGV IDDITEVKEQ
QLELGRQRDE LETLNQINTV IRRINQGVVQ AADRAAIERE VCETLTDSKL YHAAWTGEVD
TGTREVTPKT DDGLETASID RSFDIDAVDA ISMAVESGDI QLIRNVAALP EELPVTDASP
NGAFESEHSS AAVIPLIYKE TVYDVLVAYS SRANAFSVRE QAVLFELGKT IGLAINAVER
KAALLTDAVV ELEFEIRDPD VFFVSASDEL GVEFEMEGIT SQSDGTYLQY FTVTGCKPDR
VLERAGDEPG IERARIVAED EDENGALVEF IVGDSSLATA LAEYGGTVRS ARFAEGRGTC
VSAFSQTADV REVVEAARST FARTELVGKR ERERSVHTGR EFRTALEELL TERQRTVLET
AYYAGYFEWP RDSSGENVAD SLDVAPATFH QHIREGVQKL VETLIEGAAA A