Gene Htur_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1083 
Symbol 
ID8741671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1132496 
End bp1135417 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content66% 
IMG OID646511662 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_003402648 
Protein GI284164369 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACG AGGTCGAGCG AAACGACCGT CGAAGCGCTT CGGATCGGGA GACGGAATTC 
GAGGCGCGAA GTCGGGGTTC GACGTTTCGC GGACCGACCG GGCCGGTCGG CGACCACGAC
CACTCGGACG ACCACCTCGC GCTTCTGTAC GGGAGTCGCG ACGAGCAGTT CGACGCCGTG
ATTCCGTTCG TCCGGCGGGG ACTCGAGGCC GACGAGCGGT GTCTGTACCT GGCTGACGAG
ACCGACCGGG ACGAGATTTT CGAAGCGCTT CGCGACGACG GCGTCGACGT CGACGAGGCG
CTGGAATCCG ACGCGCTCGT GATCCGTGAC ACCGAAGACG CCTATCTACG GGACGGGTCG
TTAGATCTCG AGGCGTCGCT GGATCTGCTC GAGACGTTCG TCGAAGAGTC GACCGCCGAT
TCCGAGGGGG CCCGGGTGAC TGCCGAGGAG ACGTGGCTGT TGCGAGCGGC CGAGGAGGCC
GACGAGTTCA TGGCGCTGGA AGCGCGGGTG AACGAGCGCC TCGGCGGCGA GGACTGCGCC
GTGCTCTGTC AGTACGACTG CGAGCGGTTT CCCGCGCACG TCCTCGAGGA CGTCATCAAG
ACGCACCCCT ATCTCGTCAC CGACAGGACG ATTTCGGAGA ATTTCTACTA CACGCCCCCC
GAGACGTTCT TCGACGGCGA GGAACCAGCG ACGACGGTCG ACCGGATGAT TCGAACGGTA
CGCGAGCGTA CCGACGCGAA GACGGCGGTC CGCGAACACA GGGACTATCT CCGGGACCTC
TACGAGGCGA CGGCGAACGC CGACCTGACC TTCGAGGAAC GGGTCGAACG ACTGCTGGAA
CTGGGCTGTG AGCGGTTCGA TCTCAGGGGC GGCGCGCTGG CCCACCTCCC GACGTGGGAC
GATAACTTCC GTGCGGAAGT GACGGTCGGC CCCGACATGG GCGACCTCGA GGGCGAGTTA
CCCATCCAGC CCACCGAGGG AAACTTCTGC CGGCAGGCGA TCGACTGGGA CGAGCCGACC
GCCGTCCCCG ACGTCGTCGC CGCCGGCTGG GACGACGATC CCGTCTTCGA GGAGTTCGGC
TTCGCGACCT ACTTCGGCAT CCGCGTCACC GCCGGCACCG AGCCCTACGG GACGTTCTGG
TTCTACGATA CGGAGCCCCG AGACCGCCCG TTCACCGAGG CCGAGCGGAC GTTCCTCGAG
CTGATGGGCC AGTGGATCAG CACCGAACTC GAGCGCCGCC GTCGCGAGGA GTTCCTCCGC
ACGAGCTACG AGATCACGTC GGATCCCGAC CTCACCTTCG CGGCGAAGAT CGAGCGGCTG
CTCGAGCACG GCCGGAAGTG GTTCGGCTGC GACGTCGGCT ACTTCACCGC CGTCGACGCC
GAGACCGATC GCTTCGAGAT CGTCGAAGCC GTCGGCTCGC ACGACCGGAT TCGGACGGGC
GGCGGCGGCT CGCTGTCGGG AACGTACTGC AAGAAGGTCG TCGAGGCGGG CGAGTCGATC
AGCGTCGCCG ACGCCGTCGA CGCGGGCTGG GAGGGCGACC GCGCCTACGA CACGTACGGG
CTCGACGCCT TCCTCGGGAC GATGCTGGAG GTCGACGGCG AACGATTCGG AACGCTGTGT
TTCGGCTCGG AGACGCCCCG AGAGGGATCG TTCACCGAGA CGGAGTACAC GTTCATCGAC
CTCATCAGCC AGTGGGTCAG CACCGAACTC GAGCGCCGGC GGGACGAGCG GACCCAGCGC
GAACTGTACG AGATCACCGC CGATCCGGAT CGGTCGTTCG ACGAGCAGCT CCTGGCGGTC
CTGGACCTGG GCTGTGAGCG GTTCGACATG GAACTGGGCG GCATCGCGAC GGTGGACCCG
GCGACCGATC GGTTCGAGGT CGAGACCACG AACGGCGACC ACGAGTACCT CACGCCGGGT
AAGCCGTATC CCCTCTCAGA GACGTACTGC CAGGCGCCCG TGGACGAGGA GGGGACCTGT
ACGATCACCG ATCCCGTCGA ACGGGGATAC GACGGCAAAC TGTGCTACGA GCGGTTCGGC
GTCCGGGCGT ATCTCGGCAC TCACCTCGAA ATCGAGGGCA GCCCCGATCG GACATTCTGG
TTCGTCTCGA CCGAGTCCCG CGAGGAGTTC TCGGAGGCCG AACGCACGTT ACACCACCTG
ATGGGCCAGT GGGTGAAGTA CGAACTCGAC CGCCAGCAGT ACGAACGAGA CCTAGAGGAG
ACGGTCGAGC GACTCCAGCA GTCCAACGAC CGGCTCAAGC AGTTCGCCTA CGCCGCCAGC
CACGACCTGC AGGAACCGCT GCGGATGGTC TCGAGCTATC TACAACTACT CGATAACCGG
TACAAAGGCG AACTAGACGA AGAGGCCCGG GAGTTCATCG ACTTCGCGGT CGACGGCGCC
GATCGGATGC GCGAGATGAT CGACGATCTG CTGGCCTTCT CGCGGGTCGA ACACGCCGAC
GGCGAGTTCG AGCCGGTCGA CTGTACCGAG GTGCTAGACC GCGTCCAGGA CGATCTGCAG
GTCCGGATCG CGGAGACCGA CGCCGAGATC CTCGTCGATT CGCTGCCGAC GGTCAGCGCC
GACGTCGAAC AGTTCGAGCA GCTGTTCAGC AATCTCGTCT CGAACGGGAT CAAGTACAAC
GAGAGCGCGG TGCCCCGAGT CGAGGTGTCC GCCGCGGACC GGGACGACCG CTGGGAGTTC
GCGGTCGCCG ACAACGGGAT TGGCATCGAG TCGGCGAAGA CCGACCGCAT CTTCGAGGTG
TTCAAGCGCC TCCACCACGA CGACGAGTAT CCGGGCACCG GGATCGGCCT CTCGCTGTGC
CAGGAGATCG CCGACAATCA CGGCGGGGAC ATCCGGGTCG AGTCCGAGCC CGGCGCGGGA
TCGACGTTCT ACATCACGCT CCCGAAACGG AACTTCGAGT GA
 
Protein sequence
MNNEVERNDR RSASDRETEF EARSRGSTFR GPTGPVGDHD HSDDHLALLY GSRDEQFDAV 
IPFVRRGLEA DERCLYLADE TDRDEIFEAL RDDGVDVDEA LESDALVIRD TEDAYLRDGS
LDLEASLDLL ETFVEESTAD SEGARVTAEE TWLLRAAEEA DEFMALEARV NERLGGEDCA
VLCQYDCERF PAHVLEDVIK THPYLVTDRT ISENFYYTPP ETFFDGEEPA TTVDRMIRTV
RERTDAKTAV REHRDYLRDL YEATANADLT FEERVERLLE LGCERFDLRG GALAHLPTWD
DNFRAEVTVG PDMGDLEGEL PIQPTEGNFC RQAIDWDEPT AVPDVVAAGW DDDPVFEEFG
FATYFGIRVT AGTEPYGTFW FYDTEPRDRP FTEAERTFLE LMGQWISTEL ERRRREEFLR
TSYEITSDPD LTFAAKIERL LEHGRKWFGC DVGYFTAVDA ETDRFEIVEA VGSHDRIRTG
GGGSLSGTYC KKVVEAGESI SVADAVDAGW EGDRAYDTYG LDAFLGTMLE VDGERFGTLC
FGSETPREGS FTETEYTFID LISQWVSTEL ERRRDERTQR ELYEITADPD RSFDEQLLAV
LDLGCERFDM ELGGIATVDP ATDRFEVETT NGDHEYLTPG KPYPLSETYC QAPVDEEGTC
TITDPVERGY DGKLCYERFG VRAYLGTHLE IEGSPDRTFW FVSTESREEF SEAERTLHHL
MGQWVKYELD RQQYERDLEE TVERLQQSND RLKQFAYAAS HDLQEPLRMV SSYLQLLDNR
YKGELDEEAR EFIDFAVDGA DRMREMIDDL LAFSRVEHAD GEFEPVDCTE VLDRVQDDLQ
VRIAETDAEI LVDSLPTVSA DVEQFEQLFS NLVSNGIKYN ESAVPRVEVS AADRDDRWEF
AVADNGIGIE SAKTDRIFEV FKRLHHDDEY PGTGIGLSLC QEIADNHGGD IRVESEPGAG
STFYITLPKR NFE