Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5264 |
Symbol | |
ID | 8745812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | + |
Start bp | 167773 |
End bp | 170994 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646515621 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003406568 |
Protein GI | 284176291 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.255684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACG GGACCGCGAC CGTCGCCGGC GACGTCCTTC GGGTGCTCGT CGTCGGCGAC GCGCGGCGGG TCGACGCCGC GACGGACGCG CTCTCCTCGC AGCTCGAGTC GATCTCGATC GTTAGGGAAC GGACGCTCGC GACCGCCCTC GAGCGGCTCG CGCAACTCGC GATCCACTGC GTCGTCTGTC CGTTCGAGAC GGGGGCCGAT CCGTCGCCGC TCGCGGCCGT CCGCGACCGG GACGGCGAGG TGCCAGTCGT CGCCGTCGTC GACGGTGCGG CCGCCGACGG ACGCGCCGCC GAAGCGGCCC TCGAGGCGGG CGCGACCGAC GTCGTCGAGG CCGATGATCC GCCGTCGCTC GTCGCGACGC GGGTCAGGAA CCTGGCCGAT CGCTACCGTC TCGAGACCGC TCCGGAGCGC CGCGACGGGT CGGTACTCGA GCGCTCCGAC GCGCTCGTCT GGGTGGTCGA CGCGGACGGC GACCTCGAGA CCGTCAGTTC GGCGGTCGAA CCGCGACTGG GGTACACGCC GACCGAACTC GAGCGGACGC CGTTGACGCG GCTCGTCCAC CCCGAAGACC GCGAGTCGGC GACCGACCTC CTCGAGACCG CGGCGGCGAC CGCGTTCGGA ACGACCGAGC GCGGGACCGT CCGGATCGGC CACGCCGACG GGACCTGGCG CGTCTACGAC CTGCGCTGTA CCAACCGGCT CGGTGACCGC GACGTCGACG GGCTCGTCTG CACGCTCGAG CCCGCGTCGG TCCGCGAATC GGACGGCCCC GCTCGACGGG CACTCGACCG GTTCGACGAG GCGGTGTTCT CCCTCGGGCC GGCCTGGGAG CTCCGCTACG CCAACGCGGC CGCCGACCGA CTGTTCGACG CCGACGGATC GGCCGAGCCC GGAACCATCG TCTGGGACCT CCTCGACGAC GCCGTCCGCG GCCGGTTCGC CGAACGGTTC CAGGAGGCGG CCGCGACGGA GCAAGTCGTC ACCTTCGAGA CTCCGTATTC GTCGCTCGAG AGCCGGCTGT CGGTGTCCGT CCATCCCGGC GCGAACGGCG TCACCGTGTA CGCGCGCGAG GCGGACCCCG CGGCGTCTCC CGTCGACCGC GAGCGGCTCG ACCTCCTCGA GTCGGTCGTC GACGCCGTCG AGGACGGACT CGTCGTGCTC GAGGGGTCGA CGATCCGCTT CGCCAGCGCC GGTCTCTTCG AGTCCGCCGA CGCGGAGCCG CTGGTCGGCC GGGAACTCGA CGCGCTCTTC GACGACGCGC TCGCCGCGGC GGTCCGCGAG CGGGCGTCGG CGACGGTCGC CAGGTGGATG GAGCCGCTCT CGGGGACGCT CGCCCTCGAC GGACGAGCGG TCGACGTCTT CGTGACGCCG CTCTCGGACG ACCGGGTCCT CTGTGTCGTC CGCGACAGGC GCCGTTCGGC GGCGGCCGCG CTGTCGACCG TCGGCGAGAC GGTCGCGACG ATCCGGGCCG CCGACTCGCC GGGCGCCGTT CGGCGGGCGA CCGTCGACGC GGCGCTGACC TGCGCGGGCG CCGACCTCGC CGCGTGGTAC CTCCGCGAGG ACGACCGCCT CAGGCCGGCG GCGGTGGAGA CGGCGTCGAC CGCCGGCTCG GTCGACCTGC CGCCGATCGA TCCCGCCGAG ACCGAGCTGC TCGAGCGCCT CGCCGAGGCC GAGACCGCGA CCGAGGACGA GCCCGGATTC GAGACCGATA CCGACGCGGC CGGACCCGCC GTCGCATTCG ACCGGTCGGA ACTCGAGTCC GTGCTCGCGA ACGCCGGAAT CCGTGCCGAA CGGGTCGTCG CCGTTCCGGT CGGCGACCGC GGCGTGGTGC TCGCGACGAG CACCGAGCCG ATGGCCTTCG GGGAGCGCGA CCGACTCCCG CTCGAGACCG TCGTCGCCGC GGCCGCGACG GCCCTCGAGG CCCTCGAGGG CGCGGCGGCG GTGCGATCGT GTCGGACGGA CCTCGAACGC CTCGAGTACG TCGTCGACCG CTGTCGCCGG CTCCGCGAGA TCGAGCGGAC GCTGCTCGCC GGCGAGACGC GCCGCGAGAT CGAGTCCTCG CTCTGCGAGG CGCTCGTCTC CCTCTCGCTC GACGAGGAAC CCGGGGCGAT CGATCTGGCC TGGATCGGTG ACGTCTCGGC CGGCTCCGAC CACATTACGC CCGACGCCTG GGCCGGGCGG AACGGCGACG CGATCGAGTC GATGTCGGTT CCGATGGACG GGGACGACGA GTCGACGCAT CCGACCGCGA GAGCGGCGAC GGCGCTCGAA CCGACCGCCG CTGTGGATAT CGACGCCGAC GATCACGCGG ACGAGACGAC CGGCGCGTGG GACCGCCGGA CCGCCGAGCG CGAGTTCCGA TCAGCGCTGA GCGTCCCGCT GGCGATCGAC GACTTCTGTT ACGGGACCCT CACCGTCTAC GCGGAGCAGC CGGTGGCGTT CGACGACGCC ACGCGAGCGG TCTGTACCCA TCTCGCGGCG GTCGCCAGTC ACGCGATCGC CGCCGTCGAG CGCAAACGAG CGCTGCTCTC CGAGCGCGTC ACCGAACTCG AGATCGTCCT GCAGGGGGCC GACGAGCCGC TGTCGGCGGT CGCCCACCGA CTCGAGCGCC GACTCGACGT CGAGGCCGTC GTCCCGCGCT CCTCGGCCGG TTCGACGGTG TTCTGTACCG CGACCGACGT CACCGAGGAC GCGCTCCGGG CGGCGGTCGA ACCGGTGTCG GGCGTCGAGT CCGGACGGCT CGTCGGCGAG CGGCCGGACG CGTCGCTGCT CGAACTCGTC CTCACGACGT CGACGCTCGC GACGACCCTC GCCGAGCACG GCGGCGTGTT GCGCTCCGTC GTTCCGGTCG ACGATCGCAC CCGACTCGTC GTCGACCTCT CGAGCACGGT CGACGTCCGG TCGTTCGTCG GCCTGATCGA GCGCCGCCAA CCGGGGGCGA ATCTGGTCGC CCGACGCGAA CGCGACCGAT CGGTTCAGCC CGCCCGCGCG TTCGACACCG AACTCCGCGC GCGGCTCTCG GAGCGACAGC TCCGCACCCT CGAGACCGCC TACTATGGCG GCTTCTTCGA GTGGCCCCGC GAGAGCACCG GCGAGGAGAT CGCCGATTCG CTCGGCGTCT CCCAGCCGAC GTTCAGCCGC CACCTGCGGC TGGCCCAGCG GAAGGTCTTC GCGTTGCTGT TCGACGAGCG ACCCGACGCT GCCGAGGAAT AG
|
Protein sequence | MSDGTATVAG DVLRVLVVGD ARRVDAATDA LSSQLESISI VRERTLATAL ERLAQLAIHC VVCPFETGAD PSPLAAVRDR DGEVPVVAVV DGAAADGRAA EAALEAGATD VVEADDPPSL VATRVRNLAD RYRLETAPER RDGSVLERSD ALVWVVDADG DLETVSSAVE PRLGYTPTEL ERTPLTRLVH PEDRESATDL LETAAATAFG TTERGTVRIG HADGTWRVYD LRCTNRLGDR DVDGLVCTLE PASVRESDGP ARRALDRFDE AVFSLGPAWE LRYANAAADR LFDADGSAEP GTIVWDLLDD AVRGRFAERF QEAAATEQVV TFETPYSSLE SRLSVSVHPG ANGVTVYARE ADPAASPVDR ERLDLLESVV DAVEDGLVVL EGSTIRFASA GLFESADAEP LVGRELDALF DDALAAAVRE RASATVARWM EPLSGTLALD GRAVDVFVTP LSDDRVLCVV RDRRRSAAAA LSTVGETVAT IRAADSPGAV RRATVDAALT CAGADLAAWY LREDDRLRPA AVETASTAGS VDLPPIDPAE TELLERLAEA ETATEDEPGF ETDTDAAGPA VAFDRSELES VLANAGIRAE RVVAVPVGDR GVVLATSTEP MAFGERDRLP LETVVAAAAT ALEALEGAAA VRSCRTDLER LEYVVDRCRR LREIERTLLA GETRREIESS LCEALVSLSL DEEPGAIDLA WIGDVSAGSD HITPDAWAGR NGDAIESMSV PMDGDDESTH PTARAATALE PTAAVDIDAD DHADETTGAW DRRTAEREFR SALSVPLAID DFCYGTLTVY AEQPVAFDDA TRAVCTHLAA VASHAIAAVE RKRALLSERV TELEIVLQGA DEPLSAVAHR LERRLDVEAV VPRSSAGSTV FCTATDVTED ALRAAVEPVS GVESGRLVGE RPDASLLELV LTTSTLATTL AEHGGVLRSV VPVDDRTRLV VDLSSTVDVR SFVGLIERRQ PGANLVARRE RDRSVQPARA FDTELRARLS ERQLRTLETA YYGGFFEWPR ESTGEEIADS LGVSQPTFSR HLRLAQRKVF ALLFDERPDA AEE
|
| |