Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2128 |
Symbol | |
ID | 8742728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2198080 |
End bp | 2201259 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646512710 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003403684 |
Protein GI | 284165405 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGCG GCCTCGAGGT TCCCGAATCG ACCCCGATAC GAGCCCTCGC GGTCGGGACG TCGACGTGGT TGCAACGGGC GACGGCGGAG CTCGACGCCG ACGACGAGAT CACGATTCGC GGGCCGCTCG ATCCGGCGAC CGAACTCGAA GACGCCGCCC TCAACGAGAC GGACTGCCTG CTCACCGACG ACCGGGCGGT CCTCGAGCGC GTCGACGGGG AGTGTCCGAT CGTCTACGCG CTCGAACCCG CCGAGGACGA GTCGATCGAT CGGCTCCGCG CCGACGGGGC GACGGACGTG ATTCTCAAAG CCGCGGCCGA CCGATCGTCG CTGCTCGCCC ACCGTCTGCG ACGAACCGCC GAGTTCGCCG TCAGTCACAC GGCCGGACGT CAGGAAACGC GATTGCGAGC GCTGCTCGAG CACTCCTCGG ACGTGGTGCT CGTCGTCGAC GACGACGGCA CGGTCTCGGC CGTCGGTCCG TCGGCGGAAG GGATCGCCGG CTACGAGCCC GACGCGCTGG TCGGTTCCCA CTATCTGGAC TTCGTCCACC CGGACGACGT CGCGTCGATC CGAACGGCGT TCGACGACCT GTGTACCAGC GAGCCGGGGA CGACGACGAC CGTCGAGTAC CGCTGTCAGC ACGCCGACGA CGCGTGGTAC GTCCACGAGG CCGTGCTCAC GAATCGGCTG GCCGCCGAGA AACGCGCCGG TGGCGAGTCA GGCGCCGAGA TAGACGGTGT CGTCGCGTCG ATCCGAGACG TGACGGCCGT CCACCGAGTC GAGCGCGAAC TCGAGCGCTC GCTCGAACGG ATCAGCGACG CGTTCTACGC GCTCGATTCG GAGTGGCGGT TCACGTACGT CAACGACCGG GCGGGCTCCC TCCTCGGCGT CGATCCCGCC ACGGTGATCG GTCGCCCCAT CCTCGAGCTG TTTCCGGAGC TCCGAGAGAC GCCGTTCCGG ACGGCCGCGG TCGAGGCGAT GGAAACCCAG GAGCCCACGT CGGTCGAACA CTATTACGAA CCGACCGACC GCTGGTACGA CGTCCGCCTC TATCCGTCGC CGTCCGGACT CTCGGTGTAC TTCCAGGACG TGACCGACCG GGTCGAGCGC GAACGGAAAC TGCAGGACCG GACGGAACAG CTGGAGACGA TCATCCGGAA CGTCCCCGTC GTCCTCTTCG AACTCGACGC CGACGGGACG ATCACGCTGG CGGAGGGGCG CGCGTTCGAG CGCAGCGAGG CGATCGCCGA CGATATCGTC GGCGAGTCGG CGTTCGACGT CTTCGACGAC CATCCCGCGA TCCGCGCCGA CCTCAGAGCG GCCCTCGAGG GCGTGTCGAC CCACTCCTCG ATCGAACTCG ACGACCGAGT TCTCGAACAG TGGTGCCGGC CGATCGTCGA GGACGGCGGG GTCGACCGCG TGATCGGCAT CGCCGCCGAC GTCACCGAAC GCGCCCAGTA TCAGACGGCG TTGAACGCCC TCCACGAGGC GACGAGCCAC CTGCTGACCG TCGAATCGGA GCAGGCCGCC TGCGAGTACG TGGTCGACGT CGCGAACGAC GTTCTCGACC TCGAGACGGT CGTCTATCGG TTCGACGAGC GAGAAAACGA ACTCGTGCCG ACGGCCTACT CGTCCGGCCT CGAGTCGACG GTCGGATCGC CGCCGAGACT CCGGCCCGGC GAGAGCCTCG CCTGGCGGGC GTTCGTCGAC GACAGCCCGG CCCGGTTCGA CGACGTCAGG GACGCTCCGC AGGTGTTCGA CCCGACTACG GACGCTCGCA GCGGTCTCTA CGTCCCGATC GGCGAACACG GCGTCCTCGT GGCTCTCGAT CCCGAACCGG GGCGCTACGA CGAGGAGACG TTCGAACTCG CCACACTGTT CGCCAGAACG GCCGAGGCGG CGCTCGATCG CATCACCCGG ACGCGCCGGC TCCACGGCCA CGAATGGGAG CTCAAGCGCC AGAACGAACA CCTCGGGCGA CTCAACGAGG CCAGCCAGGT TCGCCAGGAT CTCGAGCAGT TGCTCCTGAT GGCCAACTCC CGCGCCGAGA TCGAACGCGG CATCTGCGAG CGCCTCGCCG ACCTCGAGTG CTGTTCGTTC GCCTGGATCG GCGAACCCGA TCCGGGCGGG AATCAGCTGT TACCTCGCCG ACAGGCGGGC CACGAGCGGG AGTATCTCGA CGCGGTCTCG GTGACGACCG TCGACGACTC CGCGACCGAA CCGGCGGGTC GAACGGCGCG GACGCGGTCG CCGGTCCTCG TCGAGAACGT CGCCGATTCG ATCCGCGAAG GGACTTGGCG CGGCGACGCC CTCTCCAGGA GCTTCCAATC GGTCTACGCC GTGCCGCTCG TCTACGACGA CTTCCTCTAT GGGGTCCTGA CGCTGTACAG CGACGACCGA GACGCGTTCG ACGAACCGCT CCGGTCCATG CTCGCCGAAC TCGGCGAAAC CATCGGCTAC TCGATCGACG CCGTCAAACG AAAGTCCCCG CTGGACGGCG AGAGCGTCCC CGTAGTCGAA CTCGAACTCG CTCTCGAGGG ACCGAATCCG CTGGGTCGGC TCGCCGATCG ACTGGACGCG CGCGCCGAAT TCGAGGGCGG GGCGATCCGC GACGACGGGA CGCCGACGGT GTTCGCGGTC GTCGACGAGA CGGATGACGT CGATCCGGCC GCCCTCGCCG AACTTGAGGG AATCGGCGAC GTCTCCGTGA TCGCCGACAC CGACTCGGAG ACGCTGTTGC AACTCCAGTA CACGGGTCCG TTTCTCGGCG CCGCCGTCGA CGCCCACGGC GGGACGCTGC GGTCGCTAGT CGCCGACGAT ACCGGGACGC GAGCGATCGT GGAGGTTCCG GAAACGGTCG AAGTCAGAAA CGTGCTGTCG CAACTCAATC GCCGCGAGTT CGCGGTCTCG CTCGTCGCCC GCCGCGAGGG TTCGACGCGG ACGCGGTCGA TGATCGACGC CGCCGCCCGT AACGCCTTAC TCGAGCAGCT GACCGATAGA CAACGCGAGG TCGTCCAGAC GGCCTACCAC GGCGGCTTCT TCGAGTGGCC CCGGGAAACG ACCGGGGAAT CGATCGCCGA CTCGCTTGGG ATCTCCTCCC CCGCCTTCCA GAAACACGTC CGAGCCACCG AGAGAAAGCT CTTCACCGCG TTGTTCGACG GTCGCTCGGT AGATGGTTAA
|
Protein sequence | MNRGLEVPES TPIRALAVGT STWLQRATAE LDADDEITIR GPLDPATELE DAALNETDCL LTDDRAVLER VDGECPIVYA LEPAEDESID RLRADGATDV ILKAAADRSS LLAHRLRRTA EFAVSHTAGR QETRLRALLE HSSDVVLVVD DDGTVSAVGP SAEGIAGYEP DALVGSHYLD FVHPDDVASI RTAFDDLCTS EPGTTTTVEY RCQHADDAWY VHEAVLTNRL AAEKRAGGES GAEIDGVVAS IRDVTAVHRV ERELERSLER ISDAFYALDS EWRFTYVNDR AGSLLGVDPA TVIGRPILEL FPELRETPFR TAAVEAMETQ EPTSVEHYYE PTDRWYDVRL YPSPSGLSVY FQDVTDRVER ERKLQDRTEQ LETIIRNVPV VLFELDADGT ITLAEGRAFE RSEAIADDIV GESAFDVFDD HPAIRADLRA ALEGVSTHSS IELDDRVLEQ WCRPIVEDGG VDRVIGIAAD VTERAQYQTA LNALHEATSH LLTVESEQAA CEYVVDVAND VLDLETVVYR FDERENELVP TAYSSGLEST VGSPPRLRPG ESLAWRAFVD DSPARFDDVR DAPQVFDPTT DARSGLYVPI GEHGVLVALD PEPGRYDEET FELATLFART AEAALDRITR TRRLHGHEWE LKRQNEHLGR LNEASQVRQD LEQLLLMANS RAEIERGICE RLADLECCSF AWIGEPDPGG NQLLPRRQAG HEREYLDAVS VTTVDDSATE PAGRTARTRS PVLVENVADS IREGTWRGDA LSRSFQSVYA VPLVYDDFLY GVLTLYSDDR DAFDEPLRSM LAELGETIGY SIDAVKRKSP LDGESVPVVE LELALEGPNP LGRLADRLDA RAEFEGGAIR DDGTPTVFAV VDETDDVDPA ALAELEGIGD VSVIADTDSE TLLQLQYTGP FLGAAVDAHG GTLRSLVADD TGTRAIVEVP ETVEVRNVLS QLNRREFAVS LVARREGSTR TRSMIDAAAR NALLEQLTDR QREVVQTAYH GGFFEWPRET TGESIADSLG ISSPAFQKHV RATERKLFTA LFDGRSVDG
|
| |