Gene Hore_10350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_10350 
Symbol 
ID7314623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1122890 
End bp1123909 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content40% 
IMG OID643611474 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002508786 
Protein GI220931878 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01362] 3-deoxy-8-phosphooctulonate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000000384679 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGTGG TTATGAAGGA TAATGCCAGT AAAAATGATA TTGAAAAGGT AGTAAAGAGA 
ATAGAAGAGC TGGGATATAA AACACATATT TCACGGGGTA CTGAAATAAC ACTTATTGGA
ATAATAGGAG AATTAAACCG TGAGGAGCTA ATTGATTCTC TGGGGGCCTA TCCCGGAATT
GAAAGGTTGG TTCCTATTCA GGAGCCCTAT AAACTGGCCG GGAAATCCTT TAATGACTCC
AGATCCAGGA TTAAAATTGG TGAAGATGTC GTTATTGGTG GTAAAGAAGT TGTAATGATG
GCTGGTCCCT GTGCAGTTGA AAGTGAACAG CAGATTATTA ATACAGCCCG GGCAGTAAAA
AAGGCAGGTG CTAAAATTCT GAGGGGTGGG GCCTTTAAAC CAAGAACCTC ACCCTACAGT
TTTCAGGGTT TACATGAAAA GGGATTAAAA TACCTTAAAA AAGCAGCCGA AGAGACTGGT
TTAAAGGTAA TAACAGAAGT AATGGACCCC AGGGATGTTG AATTAGTGGC CAGATATGCT
GATATCTTTC AAATCGGGGC CAGGAATATG CAAAACTTTT TCCTGTTAAA GGAAGTTGGA
AAAACAGATA AACCGGTTAT GTTGAAGCGT GGTATGAATG CTACTTATAA GGAATTTTTA
ATGGCAGCAG AGTATATTAT GTCAGAAGGA AACCATGATG TCATATTATG CGAAAGGGGT
ATTAGAACTT TTGAAACATA TACCCGTAAT ACCCTGGATC TGGTTAGTGT TCCTGTTTTA
AATAAACTAA GCCACTTACC TGTTGTCATT GACCCCAGTC ATGGAACAGG TCAATGGGAC
CTGGTAGGCC CGGCAGCAAG AGGGGCAGTA GCTATAGGGG CAGATGGACT TATTATAGAA
GTTCATCCTG AGCCGATTAA TGCCTTAAGT GATGGACAGC AATCCCTTAA ATTTGATAAA
TTTGAAGAAC TGGTAGATGA TCTGAAAAAG ATTGCCAGGG CAATAGGTCG TGACCTATAA
 
Protein sequence
MIVVMKDNAS KNDIEKVVKR IEELGYKTHI SRGTEITLIG IIGELNREEL IDSLGAYPGI 
ERLVPIQEPY KLAGKSFNDS RSRIKIGEDV VIGGKEVVMM AGPCAVESEQ QIINTARAVK
KAGAKILRGG AFKPRTSPYS FQGLHEKGLK YLKKAAEETG LKVITEVMDP RDVELVARYA
DIFQIGARNM QNFFLLKEVG KTDKPVMLKR GMNATYKEFL MAAEYIMSEG NHDVILCERG
IRTFETYTRN TLDLVSVPVL NKLSHLPVVI DPSHGTGQWD LVGPAARGAV AIGADGLIIE
VHPEPINALS DGQQSLKFDK FEELVDDLKK IARAIGRDL