Gene Htur_1434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1434 
Symbol 
ID8742025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1489871 
End bp1491022 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content67% 
IMG OID646512012 
Productchorismate synthase 
Protein accessionYP_003402995 
Protein GI284164716 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGCA ACCGCTTCGG TCGCCTCTTC CAGGTGACCA CGTTCGGCGA GAGCCACGGG 
GAGGCGATGG GCTGTACCAT CTCGGGCTGT CCCGCCGGCC TCGAGCTCTC GGAGGAGGAC
ATCCAGGAGG ACTTAGATCG GCGAAAGCCG GGCCAGTCGA TGATCACGAC CAGCCGCGGC
GAACCCGACG ACGTCTCGAT CAAGTCCGGG ATTCAGGACG GCTACACGAC CGGGACGCCG
ATCGGGCTGG TCATCCAGAA CAAGGACGCT CGTTCAGGCA AGTACGAGCC GTTCATCACC
GCACCCCGTC CGTCCCACGG CGACTTCACC TACTCGGCGA AGTTCGGTAC CCGTAACTGG
GGCGGCGGCG GCCGCTCGTC GGCCCGCGAG ACCGTCAACT GGGTCGCCGC GGGCGCGATC
GCAAAGAAGC TCCTCGCGCG CGAGGGAATC GAACTCAAGG CCCACGTCAA CCAGATCGGC
GACGTCGAGG CCCCCGAGGT AAGCTTCGAG CAGATTAAGG AACACTCCGA GGAGAACGAC
GTCCGCTGTG CCGATCCCGA GACCGCCGCG GAGATGCAGG AACTCATCGA GGAGTACCAG
GAGGAAGGCG ACTCCATCGG CGGCTCGATC TACTTCGAGG CCCAGGGCGT CCCCGTCGGC
CTCGGCGCAC CTCGGTTCGA CTCGCTGTCC GCGCGACTCG GACAGGCCAT GATGGCGGTC
CCGGCGACGA CGGCCTTCGA GTTCGGCCTC GGTCGCGAGG CCCGCGAGTG GACGGGCAAG
GAGCGAAACG ACGACTGGGA GTTCGACGAC GAGGGGAACC CGACGCCCGT CGAGAACGAC
CACGGCGGCA TCCAGGGCGG CATCTCGAGC GGCGAACCGA TCTACGGCGA GGTCACGCTC
CACGCACCTA CGTCGATCCC CAAGTCCCAG CAGACCGCCG ACTGGGAGAC CGGCGAAATC
AAGGAAGAGA AGGTTATCGG CCGCCACGAC CCCGTCCTCC CGCCGCGAGG CGTCCCGGTC
GTCGAGGCGA TGCTCGCGCT GACGCTCGTC GACTTCATGC TGCTGTCGGG CCGGCTCAAC
CCCGACCGCG TCGACGACCA GCCCGGCGAG TACGACACGG ACTACCACCC GAGCAACCCG
CAGAACGAGT GA
 
Protein sequence
MNGNRFGRLF QVTTFGESHG EAMGCTISGC PAGLELSEED IQEDLDRRKP GQSMITTSRG 
EPDDVSIKSG IQDGYTTGTP IGLVIQNKDA RSGKYEPFIT APRPSHGDFT YSAKFGTRNW
GGGGRSSARE TVNWVAAGAI AKKLLAREGI ELKAHVNQIG DVEAPEVSFE QIKEHSEEND
VRCADPETAA EMQELIEEYQ EEGDSIGGSI YFEAQGVPVG LGAPRFDSLS ARLGQAMMAV
PATTAFEFGL GREAREWTGK ERNDDWEFDD EGNPTPVEND HGGIQGGISS GEPIYGEVTL
HAPTSIPKSQ QTADWETGEI KEEKVIGRHD PVLPPRGVPV VEAMLALTLV DFMLLSGRLN
PDRVDDQPGE YDTDYHPSNP QNE