Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1541 |
Symbol | |
ID | 8742132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1600217 |
End bp | 1603432 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646512117 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003403100 |
Protein GI | 284164821 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGA CTCCATCGAC CGATAGCGGA GGGGCCAACG AGCGCGCCGG CCTCGGTATC CGCACCGAAC TGGGCGACGG CGAACGATTC GTCCGGACCG CGATCGACGC CGCGCCGATC GATGCGATCG TCGTCGACGG CGAGGGGACC GTCGTGTTCG CCACCGAATC GGTCGCGGAC GTGCTCGGCT ACTCGCCGGA CGAGCTCGCG GACGAACCGT TCGCGGACTT CGTCGTCGAC GCGGACGGGT CGGCTATCGC GCCCGCTGAC GATACCCGCG ACCGACTGGA CGGGCGCCTG CGTTGCGCGG ACGGCGAGAC GGTCCGCGCC CGCATCGACG TTCGCGGGTT CGAGTTCGAC GGCGAACGGT ACTATACCGC GGTGATTCGC GATTCGTCGG CGCGCGAGCC GGAGGAGCGA ACGATCGACC GCTACGACGC CATCACCGGG ATCCCGGAGT ACGGCGTCTA TCACTTGGAT CGCGACGGGC GCTTCGAGAT GGTCAACGAC ACGATCGTCG ACGCGCTCGG CTACTCTCGA GACGAACTGC TCGGCGAGCA CGCGTCGGCA GTCGTCGACG AGGACGACCT CCCGGAGTGT CGGAGCGCCG TCGAGGGACT GCTCACTGAC GGGGAACCGC GGACGGTCAC CGTCGAGTTC GCCGCGCACA CCGCCGACGG TGACGCGATC CCCTGCGAGG CTCGCGTGAC CGCGATCGAG CCCGAAGGGA CGGACTGCGG CACGGTCGGC ATCGTCCGCG ACGTCTCCGA TCGGAAGGAA CGCGCCGAGG AACTCAGGCA AGAACGGGGG TTGAACGAAC ACGTCCTCGA GACGAGTCCC GTCGGCATCG GCGTGATCAC GCCCGACGGC GATATCTCCC GCGTCAACGA CCGGGCCGAG GCGCTACTGG GGTTGACCAT GGAGGAACTC TCCGACCAGA CGTTCGACGT CTCCCAGCGG AAGCTGTACG ATTCGGAGGG CCATCGGATC TCGCCGGAAG ACCTGCTGAG CCGGGTGTTC GACGACCACG AAGAGGTTAT CGACACCGAA TTCGCCCTCG AGCGACCCGA CGGCGACCAG GTCTGGGCCT CCATCAGCAT CGCACCGATG ACGGACGCCG CCGGCGACGT CGAGAAGGCG GCGATCATCG CGACGGACAT CACGGATCGG AAGGAGCGCG AGGAGACGCT CCGGGAGGAA CGCGACGTCA TCGAGCACAT CCTCGAGACG AGCCCCGTCG GTATCGGCGT GATCACGGCC GACGGCGACA TCTCCCGCGT CAACGACCGG GCCGAGGAGC TGTTGGGGCT GACGATCGGG GAGATCACGA ATCAGACGCT CGACGTCACT CAGCGCAGGT TCTACGGCGC GAGCGGCCAG CAGGTACAGC CCGAGGACCT GCTGAGCCGG GTGTTCGAGG ACCGCGAACA CGTTCTCAAC TCCGAGTTCA GACTGGAGCG TCCCGACGGG GAGCGCGTCT GGACGGCGCT CAGCATCGCG CCCATCGAGA ACCAGGGCGG CGACGTCGAG AAAGCGGTCG TCATCGCGAC GGACATCTCC GACCGCAAGG AGCGCGAGAA GCGGTTGCGA GAGAGCGAGG CGCGACTCCG TCAGATCGCC GAGAACATCA ACAGCGCCAT CTGGATGGCG GACGCGGACC TGAGCGAGAT CCTGTACATC AATCCCGCCT ACGAGAACAT CACCGGCCGC TCGCGGGACT CCGTGTACGA CAACCTGATG AACCACCTCG ACGACGTCCA CCCGCAGGAC CGACACCGCG TCGAAACGGC GATGCAGGAA GTGACGCAAA CGCCCCGGAA CGACGGCACC GCGATCCGGT TTCAGGAGAA GTACCGCATC GTCCAGCCCG ACAGCAGCAT CCGGTGGGTC ACCAGCTTCG CGTTCCCGCT GCAGAACGAC GACGGCGACG TGTACCGGTT CGTCGGCGTC ATCGACGACA TCACCGAGGT GAAAGAACAG CAGCTGGAAC TCGGCCGTCA GCGCGACGAA CTGGAGACGC TCAACCAGAT CAATACGGTC ATCCGACGCA TCAACCAGGG GGTAGTGCAG GCGGCCGACC GAGCGGCGAT CGAACGAGAG GTCTGCGAGA CGCTCACCGA CTCGAAACTG TATCACGCGG CGTGGACCGG CGAAGTCGAT ACCGGGACCC GCGAGGTCAC CCCGAAGACG GACGACGGGC TCGAGACGGC GTCGATCGAT CGGTCGTTCG ACATCGACGC GGTCGACGCG ATCTCGATGG CGGTCGAATC GGGCGATATC CAGCTCATCC GGAACGTCGC GGCGCTCCCC GAGGAACTTC CCGTGACCGA CGCCAGTCCC AACGGTGCCT TCGAGAGCGA GCACTCGTCC GCAGCGGTGA TTCCGCTGAT CTACAAGGAG ACGGTCTACG ACGTGCTCGT CGCGTACTCG TCGCGCGCGA ACGCGTTCAG CGTCAGGGAG CAGGCCGTGC TGTTCGAACT CGGCAAGACG ATCGGACTCG CGATCAACGC CGTCGAACGG AAGGCGGCGC TGCTCACCGA CGCGGTCGTC GAACTCGAGT TCGAGATCCG CGATCCGGAC GTGTTCTTCG TGAGCGCGTC GGACGAGCTC GGGGTCGAAT TCGAAATGGA GGGCATCACG TCGCAGTCCG ACGGGACGTA TCTCCAGTAT TTCACCGTCA CTGGGTGCAA GCCCGACCGC GTGCTCGAAC GCGCCGGCGA CGAGCCCGGC ATCGAGCGCG CTCGCATCGT CGCCGAAGAC GAGGACGAGA ACGGGGCGCT CGTCGAGTTC ATCGTCGGTG ACTCGTCGCT GGCGACCGCG CTCGCCGAAT ACGGGGGCAC TGTCCGGTCC GCACGGTTCG CGGAGGGACG TGGGACCTGC GTCTCGGCGT TCTCTCAGAC GGCCGACGTC AGGGAGGTCG TCGAGGCCGC CCGGTCGACG TTCGCTCGGA CGGAACTCGT CGGCAAGCGG GAACGCGAGC GATCGGTCCA CACCGGTCGG GAGTTCCGGA CCGCACTCGA GGAGTTGCTC ACGGAACGCC AGCGGACGGT ACTCGAGACG GCGTACTACG CGGGCTACTT CGAGTGGCCG CGGGACAGCT CCGGCGAGAA CGTGGCCGAT TCGCTCGACG TCGCACCGGC GACGTTCCAC CAGCACATTC GCGAGGGGGT TCAGAAGCTG GTCGAAACGC TGATCGAAGG CGCCGCGGCC GCGTGA
|
Protein sequence | MTETPSTDSG GANERAGLGI RTELGDGERF VRTAIDAAPI DAIVVDGEGT VVFATESVAD VLGYSPDELA DEPFADFVVD ADGSAIAPAD DTRDRLDGRL RCADGETVRA RIDVRGFEFD GERYYTAVIR DSSAREPEER TIDRYDAITG IPEYGVYHLD RDGRFEMVND TIVDALGYSR DELLGEHASA VVDEDDLPEC RSAVEGLLTD GEPRTVTVEF AAHTADGDAI PCEARVTAIE PEGTDCGTVG IVRDVSDRKE RAEELRQERG LNEHVLETSP VGIGVITPDG DISRVNDRAE ALLGLTMEEL SDQTFDVSQR KLYDSEGHRI SPEDLLSRVF DDHEEVIDTE FALERPDGDQ VWASISIAPM TDAAGDVEKA AIIATDITDR KEREETLREE RDVIEHILET SPVGIGVITA DGDISRVNDR AEELLGLTIG EITNQTLDVT QRRFYGASGQ QVQPEDLLSR VFEDREHVLN SEFRLERPDG ERVWTALSIA PIENQGGDVE KAVVIATDIS DRKEREKRLR ESEARLRQIA ENINSAIWMA DADLSEILYI NPAYENITGR SRDSVYDNLM NHLDDVHPQD RHRVETAMQE VTQTPRNDGT AIRFQEKYRI VQPDSSIRWV TSFAFPLQND DGDVYRFVGV IDDITEVKEQ QLELGRQRDE LETLNQINTV IRRINQGVVQ AADRAAIERE VCETLTDSKL YHAAWTGEVD TGTREVTPKT DDGLETASID RSFDIDAVDA ISMAVESGDI QLIRNVAALP EELPVTDASP NGAFESEHSS AAVIPLIYKE TVYDVLVAYS SRANAFSVRE QAVLFELGKT IGLAINAVER KAALLTDAVV ELEFEIRDPD VFFVSASDEL GVEFEMEGIT SQSDGTYLQY FTVTGCKPDR VLERAGDEPG IERARIVAED EDENGALVEF IVGDSSLATA LAEYGGTVRS ARFAEGRGTC VSAFSQTADV REVVEAARST FARTELVGKR ERERSVHTGR EFRTALEELL TERQRTVLET AYYAGYFEWP RDSSGENVAD SLDVAPATFH QHIREGVQKL VETLIEGAAA A
|
| |