Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4089 |
Symbol | |
ID | 8744717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 346735 |
End bp | 348963 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646514649 |
Product | transcriptional regulator, TrmB |
Protein accession | YP_003405596 |
Protein GI | 284167318 |
COG category | [K] Transcription |
COG ID | [COG1378] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGGT ATGAGAACCT CGACGAATCC GAAATACGCA ATTCGCTGCA ACGCCACGTC GATATGTCGG AGTACGAGTC GCAGGTGTAT CTCGCGCTCG TTCAAAACGG AAAGCAATCG ATGCGAGATC TATCCGAGGC GAGCGACGTC CCAAAACAGC GCGTATACGA CATCGTCGAG GAACTCAGGG AGCAGGGCTT CGTCGAACTC GACGACAGTT ATCCCAAGAA GGCGTACGCG GTCGATCCCA CGAAGACGCT TGGCCCGATC CAGACGCACG TCGAACAGGT CCAAAACGCG CTCGAGGAGT TTCACAAGTC GGTGTCCGAC GTCGATAGCG GCGTCGCACA GTTCAGGAAC CGGTCGACGA TCGAGAAGTA CATCTCCGAA CTCCTCGACA GCGCCGAACG GACGATCTTC CTGATGACGT CCGTCGATCG ACTGCGGATC TTCGAGGACG CGCTACGTGA CAACTCAGAC GTCCAGGTCC GCGTCGTACT CACTGGTCTC GACGAGGGGC ACGTCGTCGA CGACCGTATC GAACTCAACA GTCCCATCCG CGAGTTTGCC GACTACGTCC GGGGGACCGT TCGTAGCGAA CCGCTCGTAC TCAGTGTGGA TCGAACTGCT GGGTTCTTTT GGCCGAGTAC TACCGACGCA CGTCGCCAGC CTCAGGAGGG ATTCTACGTC ACCGATGAGG AACTCGCGTT CATGTTCGAT CGGTTCCTCT CGGACACGGT CTGGCCGCTC GGCTATCCGG TCAACCCCGA TCAGCGCCGT TCCATCACGC TTCCGCAACG GTACTATCGA ATCCACGATT GCCTCTCCGA CCTGGAGGTA CTCACCGACT CCGTTCCCCT CCGAACCCTG ACGGTCCGAT TCGAGGGGTA CGACAACGTG TCCGGCCAGC AGGTCTCCCG AGAGGGCCGA CTCGCCGGGT ACTACGCACC GGAGTTCGAT GACCAGGCGT ACCTCGAAGT CGACATCGTC GAGGGCGACG ATGAGCAGTC TCCGACGGTG ACAGTCGGCG GCTGGCACTC GCGGCGGGAA GACTACATGG CGACGAGCAT CGATCTGGAG AAACACGAAG ACTGGTCCGC TGAAGAACTC GATGACGAGA CGCTCGCACA CATCGAGACG TGCCGGACGG AGCTCCCCGA GGAAATCGCC GGCGAGGTCA TCGTCGGCTT CGACGGTTAC ATCGACTATA TCAGATCGCT GGTCGGGGAA CGGAAGAGTC CTCGGATGTA CGACGAGATC AGCGAGTTCG ACACGTTGCG CGAGATGATC ACGAGGGCGT CGGCTCAGGA CAAGACGCTC CAGTTCGAGT GGGTCGAGAG TAGGCGGTTG CCCGGCGGCC ACACCGCCCA CGTCGGACAG GTGCTCGATA CGGCCGGATA CGATACGGAA CTCGTCGGGT TCTTCGGGCA GCCGATCCGG GACGAGTTCA GCGACGCGTT CGACGAAAAC GCGCTCCTCA GCCTGGGACA GCCGACCGTG ACGGAGTATC TACAGTTCGG CGACGGGAAG GTCCTGTTCA CCGACTCCGG TGGACATCAA GCGTTGAACT GGGAAACGCT CAGAGAATAC GTGCCGCTCG AAGATATCGT CGATCGCCTC GACGAGACTG ATCTCGTGAG CATCGGCGGC TGGGCGCTCA TCCCCGAGAT ATCGACGATC TGGGAGGGGA TCTACGAGCA GGTGTATCCG CTGCTCTCGT CGCCGCCCGA CGACATCATC GTCTGTACGA GCGACGTGCA TCGCCTAACG GAGACGACGC TCCGGTCGGA TCTGGAGTCG TTGAGCATCC TCGACGATGC GATCCCGGTG ACGGTCGTGA CGACCAGCGA ACAGGCCGCA CACTTGAGTG ACGCTCTTCT GTCCGGCGAC CGGGGGAAGC GAGCGCTCCA CGCAACGGCA GAGTCGCTTT GCCGCGAGAT CGGCGTGTCT CGGGTCGCGG TGACCGCTGC GAAAGAGTCC GTCCTCGCCG GCCCCCACGG GAGCCAACGG ATCCGATCGG CCCTGATTTC CGACCCGGCA GAGGAAGGGA CGTTTGAGGA TCACTTCAGT GCAGGTATCG CCCTAGGACG CGTCGAGGCT CTCTCGGACA CATCGACACT CGCCCTCGGA AGCGCAGTAG CGAGTTACTT CAAGCAGTAC CAGGAGACGC CGTCTCTGTC TGATATTCGG ACGTTTCTCG ATACCTACGA GAATCAGGGC CCGGCCTGA
|
Protein sequence | MSGYENLDES EIRNSLQRHV DMSEYESQVY LALVQNGKQS MRDLSEASDV PKQRVYDIVE ELREQGFVEL DDSYPKKAYA VDPTKTLGPI QTHVEQVQNA LEEFHKSVSD VDSGVAQFRN RSTIEKYISE LLDSAERTIF LMTSVDRLRI FEDALRDNSD VQVRVVLTGL DEGHVVDDRI ELNSPIREFA DYVRGTVRSE PLVLSVDRTA GFFWPSTTDA RRQPQEGFYV TDEELAFMFD RFLSDTVWPL GYPVNPDQRR SITLPQRYYR IHDCLSDLEV LTDSVPLRTL TVRFEGYDNV SGQQVSREGR LAGYYAPEFD DQAYLEVDIV EGDDEQSPTV TVGGWHSRRE DYMATSIDLE KHEDWSAEEL DDETLAHIET CRTELPEEIA GEVIVGFDGY IDYIRSLVGE RKSPRMYDEI SEFDTLREMI TRASAQDKTL QFEWVESRRL PGGHTAHVGQ VLDTAGYDTE LVGFFGQPIR DEFSDAFDEN ALLSLGQPTV TEYLQFGDGK VLFTDSGGHQ ALNWETLREY VPLEDIVDRL DETDLVSIGG WALIPEISTI WEGIYEQVYP LLSSPPDDII VCTSDVHRLT ETTLRSDLES LSILDDAIPV TVVTTSEQAA HLSDALLSGD RGKRALHATA ESLCREIGVS RVAVTAAKES VLAGPHGSQR IRSALISDPA EEGTFEDHFS AGIALGRVEA LSDTSTLALG SAVASYFKQY QETPSLSDIR TFLDTYENQG PA
|
| |