Gene Htur_2128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2128 
Symbol 
ID8742728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2198080 
End bp2201259 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content68% 
IMG OID646512710 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003403684 
Protein GI284165405 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGCG GCCTCGAGGT TCCCGAATCG ACCCCGATAC GAGCCCTCGC GGTCGGGACG 
TCGACGTGGT TGCAACGGGC GACGGCGGAG CTCGACGCCG ACGACGAGAT CACGATTCGC
GGGCCGCTCG ATCCGGCGAC CGAACTCGAA GACGCCGCCC TCAACGAGAC GGACTGCCTG
CTCACCGACG ACCGGGCGGT CCTCGAGCGC GTCGACGGGG AGTGTCCGAT CGTCTACGCG
CTCGAACCCG CCGAGGACGA GTCGATCGAT CGGCTCCGCG CCGACGGGGC GACGGACGTG
ATTCTCAAAG CCGCGGCCGA CCGATCGTCG CTGCTCGCCC ACCGTCTGCG ACGAACCGCC
GAGTTCGCCG TCAGTCACAC GGCCGGACGT CAGGAAACGC GATTGCGAGC GCTGCTCGAG
CACTCCTCGG ACGTGGTGCT CGTCGTCGAC GACGACGGCA CGGTCTCGGC CGTCGGTCCG
TCGGCGGAAG GGATCGCCGG CTACGAGCCC GACGCGCTGG TCGGTTCCCA CTATCTGGAC
TTCGTCCACC CGGACGACGT CGCGTCGATC CGAACGGCGT TCGACGACCT GTGTACCAGC
GAGCCGGGGA CGACGACGAC CGTCGAGTAC CGCTGTCAGC ACGCCGACGA CGCGTGGTAC
GTCCACGAGG CCGTGCTCAC GAATCGGCTG GCCGCCGAGA AACGCGCCGG TGGCGAGTCA
GGCGCCGAGA TAGACGGTGT CGTCGCGTCG ATCCGAGACG TGACGGCCGT CCACCGAGTC
GAGCGCGAAC TCGAGCGCTC GCTCGAACGG ATCAGCGACG CGTTCTACGC GCTCGATTCG
GAGTGGCGGT TCACGTACGT CAACGACCGG GCGGGCTCCC TCCTCGGCGT CGATCCCGCC
ACGGTGATCG GTCGCCCCAT CCTCGAGCTG TTTCCGGAGC TCCGAGAGAC GCCGTTCCGG
ACGGCCGCGG TCGAGGCGAT GGAAACCCAG GAGCCCACGT CGGTCGAACA CTATTACGAA
CCGACCGACC GCTGGTACGA CGTCCGCCTC TATCCGTCGC CGTCCGGACT CTCGGTGTAC
TTCCAGGACG TGACCGACCG GGTCGAGCGC GAACGGAAAC TGCAGGACCG GACGGAACAG
CTGGAGACGA TCATCCGGAA CGTCCCCGTC GTCCTCTTCG AACTCGACGC CGACGGGACG
ATCACGCTGG CGGAGGGGCG CGCGTTCGAG CGCAGCGAGG CGATCGCCGA CGATATCGTC
GGCGAGTCGG CGTTCGACGT CTTCGACGAC CATCCCGCGA TCCGCGCCGA CCTCAGAGCG
GCCCTCGAGG GCGTGTCGAC CCACTCCTCG ATCGAACTCG ACGACCGAGT TCTCGAACAG
TGGTGCCGGC CGATCGTCGA GGACGGCGGG GTCGACCGCG TGATCGGCAT CGCCGCCGAC
GTCACCGAAC GCGCCCAGTA TCAGACGGCG TTGAACGCCC TCCACGAGGC GACGAGCCAC
CTGCTGACCG TCGAATCGGA GCAGGCCGCC TGCGAGTACG TGGTCGACGT CGCGAACGAC
GTTCTCGACC TCGAGACGGT CGTCTATCGG TTCGACGAGC GAGAAAACGA ACTCGTGCCG
ACGGCCTACT CGTCCGGCCT CGAGTCGACG GTCGGATCGC CGCCGAGACT CCGGCCCGGC
GAGAGCCTCG CCTGGCGGGC GTTCGTCGAC GACAGCCCGG CCCGGTTCGA CGACGTCAGG
GACGCTCCGC AGGTGTTCGA CCCGACTACG GACGCTCGCA GCGGTCTCTA CGTCCCGATC
GGCGAACACG GCGTCCTCGT GGCTCTCGAT CCCGAACCGG GGCGCTACGA CGAGGAGACG
TTCGAACTCG CCACACTGTT CGCCAGAACG GCCGAGGCGG CGCTCGATCG CATCACCCGG
ACGCGCCGGC TCCACGGCCA CGAATGGGAG CTCAAGCGCC AGAACGAACA CCTCGGGCGA
CTCAACGAGG CCAGCCAGGT TCGCCAGGAT CTCGAGCAGT TGCTCCTGAT GGCCAACTCC
CGCGCCGAGA TCGAACGCGG CATCTGCGAG CGCCTCGCCG ACCTCGAGTG CTGTTCGTTC
GCCTGGATCG GCGAACCCGA TCCGGGCGGG AATCAGCTGT TACCTCGCCG ACAGGCGGGC
CACGAGCGGG AGTATCTCGA CGCGGTCTCG GTGACGACCG TCGACGACTC CGCGACCGAA
CCGGCGGGTC GAACGGCGCG GACGCGGTCG CCGGTCCTCG TCGAGAACGT CGCCGATTCG
ATCCGCGAAG GGACTTGGCG CGGCGACGCC CTCTCCAGGA GCTTCCAATC GGTCTACGCC
GTGCCGCTCG TCTACGACGA CTTCCTCTAT GGGGTCCTGA CGCTGTACAG CGACGACCGA
GACGCGTTCG ACGAACCGCT CCGGTCCATG CTCGCCGAAC TCGGCGAAAC CATCGGCTAC
TCGATCGACG CCGTCAAACG AAAGTCCCCG CTGGACGGCG AGAGCGTCCC CGTAGTCGAA
CTCGAACTCG CTCTCGAGGG ACCGAATCCG CTGGGTCGGC TCGCCGATCG ACTGGACGCG
CGCGCCGAAT TCGAGGGCGG GGCGATCCGC GACGACGGGA CGCCGACGGT GTTCGCGGTC
GTCGACGAGA CGGATGACGT CGATCCGGCC GCCCTCGCCG AACTTGAGGG AATCGGCGAC
GTCTCCGTGA TCGCCGACAC CGACTCGGAG ACGCTGTTGC AACTCCAGTA CACGGGTCCG
TTTCTCGGCG CCGCCGTCGA CGCCCACGGC GGGACGCTGC GGTCGCTAGT CGCCGACGAT
ACCGGGACGC GAGCGATCGT GGAGGTTCCG GAAACGGTCG AAGTCAGAAA CGTGCTGTCG
CAACTCAATC GCCGCGAGTT CGCGGTCTCG CTCGTCGCCC GCCGCGAGGG TTCGACGCGG
ACGCGGTCGA TGATCGACGC CGCCGCCCGT AACGCCTTAC TCGAGCAGCT GACCGATAGA
CAACGCGAGG TCGTCCAGAC GGCCTACCAC GGCGGCTTCT TCGAGTGGCC CCGGGAAACG
ACCGGGGAAT CGATCGCCGA CTCGCTTGGG ATCTCCTCCC CCGCCTTCCA GAAACACGTC
CGAGCCACCG AGAGAAAGCT CTTCACCGCG TTGTTCGACG GTCGCTCGGT AGATGGTTAA
 
Protein sequence
MNRGLEVPES TPIRALAVGT STWLQRATAE LDADDEITIR GPLDPATELE DAALNETDCL 
LTDDRAVLER VDGECPIVYA LEPAEDESID RLRADGATDV ILKAAADRSS LLAHRLRRTA
EFAVSHTAGR QETRLRALLE HSSDVVLVVD DDGTVSAVGP SAEGIAGYEP DALVGSHYLD
FVHPDDVASI RTAFDDLCTS EPGTTTTVEY RCQHADDAWY VHEAVLTNRL AAEKRAGGES
GAEIDGVVAS IRDVTAVHRV ERELERSLER ISDAFYALDS EWRFTYVNDR AGSLLGVDPA
TVIGRPILEL FPELRETPFR TAAVEAMETQ EPTSVEHYYE PTDRWYDVRL YPSPSGLSVY
FQDVTDRVER ERKLQDRTEQ LETIIRNVPV VLFELDADGT ITLAEGRAFE RSEAIADDIV
GESAFDVFDD HPAIRADLRA ALEGVSTHSS IELDDRVLEQ WCRPIVEDGG VDRVIGIAAD
VTERAQYQTA LNALHEATSH LLTVESEQAA CEYVVDVAND VLDLETVVYR FDERENELVP
TAYSSGLEST VGSPPRLRPG ESLAWRAFVD DSPARFDDVR DAPQVFDPTT DARSGLYVPI
GEHGVLVALD PEPGRYDEET FELATLFART AEAALDRITR TRRLHGHEWE LKRQNEHLGR
LNEASQVRQD LEQLLLMANS RAEIERGICE RLADLECCSF AWIGEPDPGG NQLLPRRQAG
HEREYLDAVS VTTVDDSATE PAGRTARTRS PVLVENVADS IREGTWRGDA LSRSFQSVYA
VPLVYDDFLY GVLTLYSDDR DAFDEPLRSM LAELGETIGY SIDAVKRKSP LDGESVPVVE
LELALEGPNP LGRLADRLDA RAEFEGGAIR DDGTPTVFAV VDETDDVDPA ALAELEGIGD
VSVIADTDSE TLLQLQYTGP FLGAAVDAHG GTLRSLVADD TGTRAIVEVP ETVEVRNVLS
QLNRREFAVS LVARREGSTR TRSMIDAAAR NALLEQLTDR QREVVQTAYH GGFFEWPRET
TGESIADSLG ISSPAFQKHV RATERKLFTA LFDGRSVDG