Gene Htur_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4089 
Symbol 
ID8744717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp346735 
End bp348963 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content61% 
IMG OID646514649 
Producttranscriptional regulator, TrmB 
Protein accessionYP_003405596 
Protein GI284167318 
COG category[K] Transcription 
COG ID[COG1378] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGGT ATGAGAACCT CGACGAATCC GAAATACGCA ATTCGCTGCA ACGCCACGTC 
GATATGTCGG AGTACGAGTC GCAGGTGTAT CTCGCGCTCG TTCAAAACGG AAAGCAATCG
ATGCGAGATC TATCCGAGGC GAGCGACGTC CCAAAACAGC GCGTATACGA CATCGTCGAG
GAACTCAGGG AGCAGGGCTT CGTCGAACTC GACGACAGTT ATCCCAAGAA GGCGTACGCG
GTCGATCCCA CGAAGACGCT TGGCCCGATC CAGACGCACG TCGAACAGGT CCAAAACGCG
CTCGAGGAGT TTCACAAGTC GGTGTCCGAC GTCGATAGCG GCGTCGCACA GTTCAGGAAC
CGGTCGACGA TCGAGAAGTA CATCTCCGAA CTCCTCGACA GCGCCGAACG GACGATCTTC
CTGATGACGT CCGTCGATCG ACTGCGGATC TTCGAGGACG CGCTACGTGA CAACTCAGAC
GTCCAGGTCC GCGTCGTACT CACTGGTCTC GACGAGGGGC ACGTCGTCGA CGACCGTATC
GAACTCAACA GTCCCATCCG CGAGTTTGCC GACTACGTCC GGGGGACCGT TCGTAGCGAA
CCGCTCGTAC TCAGTGTGGA TCGAACTGCT GGGTTCTTTT GGCCGAGTAC TACCGACGCA
CGTCGCCAGC CTCAGGAGGG ATTCTACGTC ACCGATGAGG AACTCGCGTT CATGTTCGAT
CGGTTCCTCT CGGACACGGT CTGGCCGCTC GGCTATCCGG TCAACCCCGA TCAGCGCCGT
TCCATCACGC TTCCGCAACG GTACTATCGA ATCCACGATT GCCTCTCCGA CCTGGAGGTA
CTCACCGACT CCGTTCCCCT CCGAACCCTG ACGGTCCGAT TCGAGGGGTA CGACAACGTG
TCCGGCCAGC AGGTCTCCCG AGAGGGCCGA CTCGCCGGGT ACTACGCACC GGAGTTCGAT
GACCAGGCGT ACCTCGAAGT CGACATCGTC GAGGGCGACG ATGAGCAGTC TCCGACGGTG
ACAGTCGGCG GCTGGCACTC GCGGCGGGAA GACTACATGG CGACGAGCAT CGATCTGGAG
AAACACGAAG ACTGGTCCGC TGAAGAACTC GATGACGAGA CGCTCGCACA CATCGAGACG
TGCCGGACGG AGCTCCCCGA GGAAATCGCC GGCGAGGTCA TCGTCGGCTT CGACGGTTAC
ATCGACTATA TCAGATCGCT GGTCGGGGAA CGGAAGAGTC CTCGGATGTA CGACGAGATC
AGCGAGTTCG ACACGTTGCG CGAGATGATC ACGAGGGCGT CGGCTCAGGA CAAGACGCTC
CAGTTCGAGT GGGTCGAGAG TAGGCGGTTG CCCGGCGGCC ACACCGCCCA CGTCGGACAG
GTGCTCGATA CGGCCGGATA CGATACGGAA CTCGTCGGGT TCTTCGGGCA GCCGATCCGG
GACGAGTTCA GCGACGCGTT CGACGAAAAC GCGCTCCTCA GCCTGGGACA GCCGACCGTG
ACGGAGTATC TACAGTTCGG CGACGGGAAG GTCCTGTTCA CCGACTCCGG TGGACATCAA
GCGTTGAACT GGGAAACGCT CAGAGAATAC GTGCCGCTCG AAGATATCGT CGATCGCCTC
GACGAGACTG ATCTCGTGAG CATCGGCGGC TGGGCGCTCA TCCCCGAGAT ATCGACGATC
TGGGAGGGGA TCTACGAGCA GGTGTATCCG CTGCTCTCGT CGCCGCCCGA CGACATCATC
GTCTGTACGA GCGACGTGCA TCGCCTAACG GAGACGACGC TCCGGTCGGA TCTGGAGTCG
TTGAGCATCC TCGACGATGC GATCCCGGTG ACGGTCGTGA CGACCAGCGA ACAGGCCGCA
CACTTGAGTG ACGCTCTTCT GTCCGGCGAC CGGGGGAAGC GAGCGCTCCA CGCAACGGCA
GAGTCGCTTT GCCGCGAGAT CGGCGTGTCT CGGGTCGCGG TGACCGCTGC GAAAGAGTCC
GTCCTCGCCG GCCCCCACGG GAGCCAACGG ATCCGATCGG CCCTGATTTC CGACCCGGCA
GAGGAAGGGA CGTTTGAGGA TCACTTCAGT GCAGGTATCG CCCTAGGACG CGTCGAGGCT
CTCTCGGACA CATCGACACT CGCCCTCGGA AGCGCAGTAG CGAGTTACTT CAAGCAGTAC
CAGGAGACGC CGTCTCTGTC TGATATTCGG ACGTTTCTCG ATACCTACGA GAATCAGGGC
CCGGCCTGA
 
Protein sequence
MSGYENLDES EIRNSLQRHV DMSEYESQVY LALVQNGKQS MRDLSEASDV PKQRVYDIVE 
ELREQGFVEL DDSYPKKAYA VDPTKTLGPI QTHVEQVQNA LEEFHKSVSD VDSGVAQFRN
RSTIEKYISE LLDSAERTIF LMTSVDRLRI FEDALRDNSD VQVRVVLTGL DEGHVVDDRI
ELNSPIREFA DYVRGTVRSE PLVLSVDRTA GFFWPSTTDA RRQPQEGFYV TDEELAFMFD
RFLSDTVWPL GYPVNPDQRR SITLPQRYYR IHDCLSDLEV LTDSVPLRTL TVRFEGYDNV
SGQQVSREGR LAGYYAPEFD DQAYLEVDIV EGDDEQSPTV TVGGWHSRRE DYMATSIDLE
KHEDWSAEEL DDETLAHIET CRTELPEEIA GEVIVGFDGY IDYIRSLVGE RKSPRMYDEI
SEFDTLREMI TRASAQDKTL QFEWVESRRL PGGHTAHVGQ VLDTAGYDTE LVGFFGQPIR
DEFSDAFDEN ALLSLGQPTV TEYLQFGDGK VLFTDSGGHQ ALNWETLREY VPLEDIVDRL
DETDLVSIGG WALIPEISTI WEGIYEQVYP LLSSPPDDII VCTSDVHRLT ETTLRSDLES
LSILDDAIPV TVVTTSEQAA HLSDALLSGD RGKRALHATA ESLCREIGVS RVAVTAAKES
VLAGPHGSQR IRSALISDPA EEGTFEDHFS AGIALGRVEA LSDTSTLALG SAVASYFKQY
QETPSLSDIR TFLDTYENQG PA