Gene Htur_4080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4080 
Symbol 
ID8744708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp332436 
End bp335681 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content62% 
IMG OID646514641 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003405588 
Protein GI284167310 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTCGT CCGTACCATC GCCTGCCACG GTCCAAGCGG TCTTCGACCA GCTCGGTCCC 
CCTGGGACCC CCTTCACGAC GCCGGAGATC GCCGCGGAGT TCGACTGTTC CGACCGAACG
ATCTACAATC GACTCGACGC GCTCGTCGAC GAGGGCGTCA TTGAGACGAA GAAAGTCGGC
GCTCGCGGAC GGGTGTGGTG GAGGCCCGTC GACGGCGACA TTCGACGGAA CGGCGGCGCT
TTCAACGAAC GGAACCCCGT CTCATTCCGA GATGAGCAGG CACTATCCTT TCTCTCTGAT
AGCGAGATGG CCGAACGCAT CCGCAAGTTC GAGTGGGCCA AAACGCCGCT TGGCCCGATG
GACGGGTGGC CCCTAGAGTT GCGGGTCGCG GCCGACATCA TGCTGGGGGC GGACGAGGCC
ATCGGTCTCT ACTGGGGAGA GGACCTGACA CTGCTGTACA ATGACGCCTG GCGGGAGTTG
ATCGGTGACA AGCATCCGGA GGCGCTGGGG CGGCCTGCCC AAGAGGTGTT TCCCGAGATC
TGGGAGACGA TCGAGCCGAT GTTCGCTGAC GTGCTGGATG GGAATGGAGT TGGGTTCGAA
CGGGAACAGC GACTGTCGCT GGAACGTGAC GGCCAGATAG AGGACGCGTG GTTCGACTAC
AGCGCCAATC CGATTCTGAT GGCCGACGGC TCCGTCGGTG GCGTCTTCAA CATCGCGAAC
GAAATCACCG AGAGAAAAGA CGCCCAACAA ACCCTGCGCG ACAGAGAGGG GCGACTCAAC
GCATTCGTCA CAAGTACCTC GGAAATCGTC TATCGAATGA GTCCGGACTG GTCAGAGATG
TACTACTTGG ACGGCAAGGA TTTCATCGCC GACACGGAGG ATCCTCGGGA AACGTGGTTG
AAGGAGTACA TTCCACCGGA CGAGCAAGAG CGGGTGATGG CCGCCATTGA GGAGGCCATC
GAGACGAAAA GCATGTTCGA ACTGGAGCAC CAGGTTCACC AGGTCGATGG CACCCGAGGC
TGGACGCACT CACGGGCAGT GCCGATACTG GACGACGACG GCGACATCGT CGAGTGGTTC
GGGACGGCCA GTGACGTCAC CGAGCGAAGG CGCGCTGAAC AGGCTCTTCG TGAGTCAGAG
CAACGCTATC AAAGGCTGTT CGACTCCGTC AACGAGTCCA TCGAAGAGGC CTTTTTCATC
ATCGAGCGGG TGTCCGGCGA GGGCGAGACT GCCGCAAACG GGGACGACCC GGCAGCGTAT
CGCTTCGTGG AAACGAACCC GGTGTTCGAG AAATCCATCG GTGTGACCGA CGTAGTCGGG
AAACAAGTGA GCGATCTCGA TTTCGACGGC GACCTCCCCG GATCGGACGT GTGGGGAGAG
GTGGTCCGGA CAGGCGAATC GAGACGCTTC GAGATCGAGA CCACTGGTGG TCCGCTCGCC
GATGGCTGGT ACGATGTTCG CGTCTTCCCG TATGGCGGAG CGGACAGTCG AGCCGCTGCG
TGTCTCGCCG ACGATATCAC AGACCAAAAG GAAGTCCAAC GGTCTCTTGA ACGGCTCACC
GAGGCAAGCC GGGAACTGAT AGAGGCCGAT CCAGAGATGG TCCACGACCG CGTGGCTGAG
CTCACGATAG CGGTGCTTGA CGTCGAGTAC GCTGCGCTCT GGCGGTATGA CGAGGCAAGC
GGAGACTTGA TCGACGCCAT CAGCACTCTC GACACGGGGA TCGATGCCGA AACCGTCCGA
CACCCGGATG ATGCCTCTGA GCACGTCTGG CAGGCGTTTA TCGACGATGA GACCGCCGTC
ACAAACAACC TCCACGTCGA CGAGACCGTG CCGGACGCAG CGACCCTGCG GAGCCGCCTG
CTCATTCCAC TGGGCAGACA TGGCGTTGTC TGTATCGGAT CCTTCCAGCC GAATGGCTTC
GATGAGCGGA TAGTCGACCT TGCCGAAACT CTCGGGGCGA CGCTCGAAAC GGCGTGGGAC
CGTGCCGACA GCGAACGCCA ATTGCAAGTG CAGAACGAAG AACTGCAACG CCTCGACCGT
CTCAACACCC TCATACGAGA GATCGATCAG GCACTTGTGG GAGCCGATAC CCGCGAGGAG
ATCGATGAGG CAGTCTGCGA ACGACTGGCC AGCTCCGACC TCTACGAGTT CGCGTGGCTC
GGAGAGTACG ATCCGGGGAC CAACCGGATA GAACCGCGTG CGTGGGCTGG CGTCGACAGC
GGCTACGTAG AGGAGCTCAC GATCACGGTC GAGGACACAC CGACCAACCG AGACCCAATC
GCCCGTGCCC TCCGCACGGG CGACTTGCAG GTGGTTGCAG ACATCGCTAC GGATAGCGGC
TTCGCCCCCT GGCGGGAAGC GACGCTCGAA CGCGGCGCCC GGTCGCTGGT CTGCATTCCA
CTCGTTTACG ACGACGCGGC GTACGGCGTG TTGACAGTCT ACGCCGACCG TCCCCAGTCC
GACGAGGACA AACGGAATCG GGACGTGCTG TCGGAACTCG GTGATACGAC CGCCCATGCG
CTCAACGCGA GGGAGACGCG GGCGACGTTG CAGACCGACA GCGTCGTCGA ACTCACGCTC
CGATTCGAGG ACGCTGACAC GCCGCTGTAT CGTCTCTCCC GGGAGACGGA GTATACCATC
GAGCATCAGG GGTTCGTTCC CCGATCGAGC GGGCAGACCG ACGTCTTCTT CATCGTTCGT
GAAATCTCGC CGGAAGATCT CCGGGCCACA GCAGAACGCT CGCTCGCGTT CGAGGACCTA
GACTGTCTCA CCGAGAGGGC CGATGGAGCA CTGTTCAGGG CACGGGTGTC TGAGCCGACG
CTCGCCGCAC GGGTCACTGA CGAGGGGGCT GTCGTGCGTT CGATTACCAT CGATTCCGGG
GTTGCAACCG TTGTTCTCGA TATCCCCCAC ACGGCAGCGG TCCGCGAGTT CCTGAACCGA
CTCCGCCAGT GGCATCCGGA ATTGGAGCTA CGCGCCCGCC AGTCGCGCGA ACGGCCACTG
AAGACCCGGC AAACCTTCGT GACGGCGCTC GAGGACCGCC TGACGGATCG ACAGCGGGAA
GTCCTGCAGA CGGCCTACCT GAGCGGCTTC TTCGAGATGC CACGGGTCAG TAACGGACAG
GAGGTCACAC ACCTGGTCGG CGTCTCACAG CCGACGTTCT CTGAGCACCT GCGTGCCGCT
GAACGTACCC TGTGTGAGGT CCTATTCGAG ACCGAACCGT ATGCCGAGGA TATCGTCTCG
ACGTAG
 
Protein sequence
MGSSVPSPAT VQAVFDQLGP PGTPFTTPEI AAEFDCSDRT IYNRLDALVD EGVIETKKVG 
ARGRVWWRPV DGDIRRNGGA FNERNPVSFR DEQALSFLSD SEMAERIRKF EWAKTPLGPM
DGWPLELRVA ADIMLGADEA IGLYWGEDLT LLYNDAWREL IGDKHPEALG RPAQEVFPEI
WETIEPMFAD VLDGNGVGFE REQRLSLERD GQIEDAWFDY SANPILMADG SVGGVFNIAN
EITERKDAQQ TLRDREGRLN AFVTSTSEIV YRMSPDWSEM YYLDGKDFIA DTEDPRETWL
KEYIPPDEQE RVMAAIEEAI ETKSMFELEH QVHQVDGTRG WTHSRAVPIL DDDGDIVEWF
GTASDVTERR RAEQALRESE QRYQRLFDSV NESIEEAFFI IERVSGEGET AANGDDPAAY
RFVETNPVFE KSIGVTDVVG KQVSDLDFDG DLPGSDVWGE VVRTGESRRF EIETTGGPLA
DGWYDVRVFP YGGADSRAAA CLADDITDQK EVQRSLERLT EASRELIEAD PEMVHDRVAE
LTIAVLDVEY AALWRYDEAS GDLIDAISTL DTGIDAETVR HPDDASEHVW QAFIDDETAV
TNNLHVDETV PDAATLRSRL LIPLGRHGVV CIGSFQPNGF DERIVDLAET LGATLETAWD
RADSERQLQV QNEELQRLDR LNTLIREIDQ ALVGADTREE IDEAVCERLA SSDLYEFAWL
GEYDPGTNRI EPRAWAGVDS GYVEELTITV EDTPTNRDPI ARALRTGDLQ VVADIATDSG
FAPWREATLE RGARSLVCIP LVYDDAAYGV LTVYADRPQS DEDKRNRDVL SELGDTTAHA
LNARETRATL QTDSVVELTL RFEDADTPLY RLSRETEYTI EHQGFVPRSS GQTDVFFIVR
EISPEDLRAT AERSLAFEDL DCLTERADGA LFRARVSEPT LAARVTDEGA VVRSITIDSG
VATVVLDIPH TAAVREFLNR LRQWHPELEL RARQSRERPL KTRQTFVTAL EDRLTDRQRE
VLQTAYLSGF FEMPRVSNGQ EVTHLVGVSQ PTFSEHLRAA ERTLCEVLFE TEPYAEDIVS
T