Gene Htur_5264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5264 
Symbol 
ID8745812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp167773 
End bp170994 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content73% 
IMG OID646515621 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003406568 
Protein GI284176291 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG GGACCGCGAC CGTCGCCGGC GACGTCCTTC GGGTGCTCGT CGTCGGCGAC 
GCGCGGCGGG TCGACGCCGC GACGGACGCG CTCTCCTCGC AGCTCGAGTC GATCTCGATC
GTTAGGGAAC GGACGCTCGC GACCGCCCTC GAGCGGCTCG CGCAACTCGC GATCCACTGC
GTCGTCTGTC CGTTCGAGAC GGGGGCCGAT CCGTCGCCGC TCGCGGCCGT CCGCGACCGG
GACGGCGAGG TGCCAGTCGT CGCCGTCGTC GACGGTGCGG CCGCCGACGG ACGCGCCGCC
GAAGCGGCCC TCGAGGCGGG CGCGACCGAC GTCGTCGAGG CCGATGATCC GCCGTCGCTC
GTCGCGACGC GGGTCAGGAA CCTGGCCGAT CGCTACCGTC TCGAGACCGC TCCGGAGCGC
CGCGACGGGT CGGTACTCGA GCGCTCCGAC GCGCTCGTCT GGGTGGTCGA CGCGGACGGC
GACCTCGAGA CCGTCAGTTC GGCGGTCGAA CCGCGACTGG GGTACACGCC GACCGAACTC
GAGCGGACGC CGTTGACGCG GCTCGTCCAC CCCGAAGACC GCGAGTCGGC GACCGACCTC
CTCGAGACCG CGGCGGCGAC CGCGTTCGGA ACGACCGAGC GCGGGACCGT CCGGATCGGC
CACGCCGACG GGACCTGGCG CGTCTACGAC CTGCGCTGTA CCAACCGGCT CGGTGACCGC
GACGTCGACG GGCTCGTCTG CACGCTCGAG CCCGCGTCGG TCCGCGAATC GGACGGCCCC
GCTCGACGGG CACTCGACCG GTTCGACGAG GCGGTGTTCT CCCTCGGGCC GGCCTGGGAG
CTCCGCTACG CCAACGCGGC CGCCGACCGA CTGTTCGACG CCGACGGATC GGCCGAGCCC
GGAACCATCG TCTGGGACCT CCTCGACGAC GCCGTCCGCG GCCGGTTCGC CGAACGGTTC
CAGGAGGCGG CCGCGACGGA GCAAGTCGTC ACCTTCGAGA CTCCGTATTC GTCGCTCGAG
AGCCGGCTGT CGGTGTCCGT CCATCCCGGC GCGAACGGCG TCACCGTGTA CGCGCGCGAG
GCGGACCCCG CGGCGTCTCC CGTCGACCGC GAGCGGCTCG ACCTCCTCGA GTCGGTCGTC
GACGCCGTCG AGGACGGACT CGTCGTGCTC GAGGGGTCGA CGATCCGCTT CGCCAGCGCC
GGTCTCTTCG AGTCCGCCGA CGCGGAGCCG CTGGTCGGCC GGGAACTCGA CGCGCTCTTC
GACGACGCGC TCGCCGCGGC GGTCCGCGAG CGGGCGTCGG CGACGGTCGC CAGGTGGATG
GAGCCGCTCT CGGGGACGCT CGCCCTCGAC GGACGAGCGG TCGACGTCTT CGTGACGCCG
CTCTCGGACG ACCGGGTCCT CTGTGTCGTC CGCGACAGGC GCCGTTCGGC GGCGGCCGCG
CTGTCGACCG TCGGCGAGAC GGTCGCGACG ATCCGGGCCG CCGACTCGCC GGGCGCCGTT
CGGCGGGCGA CCGTCGACGC GGCGCTGACC TGCGCGGGCG CCGACCTCGC CGCGTGGTAC
CTCCGCGAGG ACGACCGCCT CAGGCCGGCG GCGGTGGAGA CGGCGTCGAC CGCCGGCTCG
GTCGACCTGC CGCCGATCGA TCCCGCCGAG ACCGAGCTGC TCGAGCGCCT CGCCGAGGCC
GAGACCGCGA CCGAGGACGA GCCCGGATTC GAGACCGATA CCGACGCGGC CGGACCCGCC
GTCGCATTCG ACCGGTCGGA ACTCGAGTCC GTGCTCGCGA ACGCCGGAAT CCGTGCCGAA
CGGGTCGTCG CCGTTCCGGT CGGCGACCGC GGCGTGGTGC TCGCGACGAG CACCGAGCCG
ATGGCCTTCG GGGAGCGCGA CCGACTCCCG CTCGAGACCG TCGTCGCCGC GGCCGCGACG
GCCCTCGAGG CCCTCGAGGG CGCGGCGGCG GTGCGATCGT GTCGGACGGA CCTCGAACGC
CTCGAGTACG TCGTCGACCG CTGTCGCCGG CTCCGCGAGA TCGAGCGGAC GCTGCTCGCC
GGCGAGACGC GCCGCGAGAT CGAGTCCTCG CTCTGCGAGG CGCTCGTCTC CCTCTCGCTC
GACGAGGAAC CCGGGGCGAT CGATCTGGCC TGGATCGGTG ACGTCTCGGC CGGCTCCGAC
CACATTACGC CCGACGCCTG GGCCGGGCGG AACGGCGACG CGATCGAGTC GATGTCGGTT
CCGATGGACG GGGACGACGA GTCGACGCAT CCGACCGCGA GAGCGGCGAC GGCGCTCGAA
CCGACCGCCG CTGTGGATAT CGACGCCGAC GATCACGCGG ACGAGACGAC CGGCGCGTGG
GACCGCCGGA CCGCCGAGCG CGAGTTCCGA TCAGCGCTGA GCGTCCCGCT GGCGATCGAC
GACTTCTGTT ACGGGACCCT CACCGTCTAC GCGGAGCAGC CGGTGGCGTT CGACGACGCC
ACGCGAGCGG TCTGTACCCA TCTCGCGGCG GTCGCCAGTC ACGCGATCGC CGCCGTCGAG
CGCAAACGAG CGCTGCTCTC CGAGCGCGTC ACCGAACTCG AGATCGTCCT GCAGGGGGCC
GACGAGCCGC TGTCGGCGGT CGCCCACCGA CTCGAGCGCC GACTCGACGT CGAGGCCGTC
GTCCCGCGCT CCTCGGCCGG TTCGACGGTG TTCTGTACCG CGACCGACGT CACCGAGGAC
GCGCTCCGGG CGGCGGTCGA ACCGGTGTCG GGCGTCGAGT CCGGACGGCT CGTCGGCGAG
CGGCCGGACG CGTCGCTGCT CGAACTCGTC CTCACGACGT CGACGCTCGC GACGACCCTC
GCCGAGCACG GCGGCGTGTT GCGCTCCGTC GTTCCGGTCG ACGATCGCAC CCGACTCGTC
GTCGACCTCT CGAGCACGGT CGACGTCCGG TCGTTCGTCG GCCTGATCGA GCGCCGCCAA
CCGGGGGCGA ATCTGGTCGC CCGACGCGAA CGCGACCGAT CGGTTCAGCC CGCCCGCGCG
TTCGACACCG AACTCCGCGC GCGGCTCTCG GAGCGACAGC TCCGCACCCT CGAGACCGCC
TACTATGGCG GCTTCTTCGA GTGGCCCCGC GAGAGCACCG GCGAGGAGAT CGCCGATTCG
CTCGGCGTCT CCCAGCCGAC GTTCAGCCGC CACCTGCGGC TGGCCCAGCG GAAGGTCTTC
GCGTTGCTGT TCGACGAGCG ACCCGACGCT GCCGAGGAAT AG
 
Protein sequence
MSDGTATVAG DVLRVLVVGD ARRVDAATDA LSSQLESISI VRERTLATAL ERLAQLAIHC 
VVCPFETGAD PSPLAAVRDR DGEVPVVAVV DGAAADGRAA EAALEAGATD VVEADDPPSL
VATRVRNLAD RYRLETAPER RDGSVLERSD ALVWVVDADG DLETVSSAVE PRLGYTPTEL
ERTPLTRLVH PEDRESATDL LETAAATAFG TTERGTVRIG HADGTWRVYD LRCTNRLGDR
DVDGLVCTLE PASVRESDGP ARRALDRFDE AVFSLGPAWE LRYANAAADR LFDADGSAEP
GTIVWDLLDD AVRGRFAERF QEAAATEQVV TFETPYSSLE SRLSVSVHPG ANGVTVYARE
ADPAASPVDR ERLDLLESVV DAVEDGLVVL EGSTIRFASA GLFESADAEP LVGRELDALF
DDALAAAVRE RASATVARWM EPLSGTLALD GRAVDVFVTP LSDDRVLCVV RDRRRSAAAA
LSTVGETVAT IRAADSPGAV RRATVDAALT CAGADLAAWY LREDDRLRPA AVETASTAGS
VDLPPIDPAE TELLERLAEA ETATEDEPGF ETDTDAAGPA VAFDRSELES VLANAGIRAE
RVVAVPVGDR GVVLATSTEP MAFGERDRLP LETVVAAAAT ALEALEGAAA VRSCRTDLER
LEYVVDRCRR LREIERTLLA GETRREIESS LCEALVSLSL DEEPGAIDLA WIGDVSAGSD
HITPDAWAGR NGDAIESMSV PMDGDDESTH PTARAATALE PTAAVDIDAD DHADETTGAW
DRRTAEREFR SALSVPLAID DFCYGTLTVY AEQPVAFDDA TRAVCTHLAA VASHAIAAVE
RKRALLSERV TELEIVLQGA DEPLSAVAHR LERRLDVEAV VPRSSAGSTV FCTATDVTED
ALRAAVEPVS GVESGRLVGE RPDASLLELV LTTSTLATTL AEHGGVLRSV VPVDDRTRLV
VDLSSTVDVR SFVGLIERRQ PGANLVARRE RDRSVQPARA FDTELRARLS ERQLRTLETA
YYGGFFEWPR ESTGEEIADS LGVSQPTFSR HLRLAQRKVF ALLFDERPDA AEE