Gene Hhal_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0996 
Symbol 
ID4709568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1068145 
End bp1069221 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content67% 
IMG OID639855467 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001002574 
Protein GI121997787 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAGA GCGATCACGT CAACGACGTC AATGTGGCCA CCGGTGAACG CCTTCCGTCT 
CCGGCCGAGA TCAAGTCCGA GGTCCCGCTG ACGGATGCCG CCCGGCAGAC CGTGCTGGAC
GGGCGGCAGG TCCTGCGGGA CATCCTCGAC GGCAAGGATC AGCGCATCTT CGCCGTGGTC
GGGCCGTGCT CCATCCACGA CCCGGAGGCG GCGCTCGACT ACGCGCGGCG GCTCAAGGCG
CTGCACGATG AGCTCAGCGA TCACATCTAC TTGGTGATGC GGGTTTACTT CGAGAAACCG
CGCACCACCA CGGGCTGGAA GGGGCTGATC AACGACCCGG ACATGGACGA CTCCTTCCGG
ATCGATAAGG GCCTGCGCAT GGGCCGCGAG CTGCTCCGCG AGATCGCCGC CATGGGGCTG
CCCACGGCGA CTGAGGCCCT CGACCCCTAC GCACCGCAAT ACTACGGCGA CCTGGTTTCG
TGGACCGCGA TCGGCGCGCG TACCACCGAG TCCCAGACCC ACCGCGAGAT GGCCAGCGGG
CTGTCCACGC CGGTGGGCTT CAAGAACGCC ACTGACGGCA GCCAGACGGT GGCGATCAAC
GCCCTGCAAT CGGCGGCGTC CCCCCACAGT TTCCTGGGCA TCGACCAGGA GGGCCGCATC
ACCGTCATCC GTACCCGGGG CAACCAGTAC GGCCACGTCG TGCTGCGGGG CGGGGCGCAG
CCCAACTACG ACTCGGTGAG TATCCGGCTG TGCGAGCAGG CGCTGGAGAA GGCCGGCATG
CCGCTGCGCG TGGTGGTCGA CTGCAGCCAC TCCAACTCCA ACAAGGATCC CGGGCTACAG
TCGATGGTGC TGGAGGACGT GATCCGTCAG CTCCGCGAGG GCAATCGCTC CATCGTCGGG
GTGATGCTGG AGAGCAACAT CGGTTGGGGC AGTCAGAAGC TCGGTGCCGA TCCGGGTGCC
CTGGACTACG GCATCTCCAT CACCGATGCT TGTATCGACT GGGAGACCAC CGAACAGGTT
CTCCGGGACG CCGCGGGGCA GCTGCGCGGC AGCCTGCGCG AGCGCGAGCT GCTGTAG
 
Protein sequence
MQESDHVNDV NVATGERLPS PAEIKSEVPL TDAARQTVLD GRQVLRDILD GKDQRIFAVV 
GPCSIHDPEA ALDYARRLKA LHDELSDHIY LVMRVYFEKP RTTTGWKGLI NDPDMDDSFR
IDKGLRMGRE LLREIAAMGL PTATEALDPY APQYYGDLVS WTAIGARTTE SQTHREMASG
LSTPVGFKNA TDGSQTVAIN ALQSAASPHS FLGIDQEGRI TVIRTRGNQY GHVVLRGGAQ
PNYDSVSIRL CEQALEKAGM PLRVVVDCSH SNSNKDPGLQ SMVLEDVIRQ LREGNRSIVG
VMLESNIGWG SQKLGADPGA LDYGISITDA CIDWETTEQV LRDAAGQLRG SLRERELL