Gene Haur_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4179 
Symbol 
ID5736041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5329582 
End bp5330607 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID641281334 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001546939 
Protein GI159900692 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0656088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGTTG TAATGAAGGC CCACGCTGAC CTCGCCGACC GTGACGCTGT ATTGGCTCGC 
TTAGCCGAAA ATAAGCTCAA AGGCCATTTA TCGGAAGGCG AGGAACGAAT TGTGATTGGA
GTTGTTGGTG CGAAAATTCC AGCGGGCTTG GAAGAACAAT TGCAATCGAT GAGCGGCGTG
CAAACAACCT TGCGAATTAC GCGCCCCTAT AAATTAGCTG GCCGCGAGTT TCAACAACAC
AATACCGTCA TTCGGATTGG CGATTTGGAA ATTGGCGGCG GTACTCCAGT CGTGATGGCT
GGGCCATGTT CGGTCGAAAG TGCTGAGCAA TTATTAAGTA CAGCACACGC GGTCAAAGCA
GCCGGAGCCA ATATCCTCCG TGGTGGCGCA TTCAAGCCAC GCACTTCACC CTACGCTTTC
CGTGGTTTGG GCGAAGAAGG CCTGAAAATT TTGGCCCAAG CTCGCGAAGA AACAGGTTTA
CCAATCATCA CCGAAGCCCT GAATACCCGT GATGTTGAAT TAGTCGCCCG CTACACTGAT
ATCATCCAAC TTGGCGCTCG CAATATGCAA AATTTCGCCC TCTTGGAAGA AGCAGGCCAA
ACGGGTAAGC CGATCATGGT CAAGCGTGGC CCTTCAGCAA CAGTCGAAGA ATGGTTGTTG
GCGGCTGAAT ATATTCTCGC GACTGGTAAT CGCAACGTTA TTTTGTGCGA ACGCGGTATT
CGCACCTACG AAACTGCCAC CCGTAACACC CTCGATTTGA ATGCTGTGGC AGTTGCCAAG
CGTCGGACTC ACTTGCCGGT GATTGCCGAC CCCAGCCATG GCACTGGCAA ATGGTACTTG
GTGCAGCCAA TGGCCTTGGC CGGCTTGGCC GCTGGCGCTG ATGGTCTGAT GATCGAAGTT
CACCACGATC CCGACCGTGC TTCTTCCGAT GGCCCTCAAT CACTCAACCA CCACAATTTT
GCCCAATTGA TGCAACAAGT TCGTCGCCTG ATCGCAGCTT TGGAGCCAGA ATTAGCAGTT
GCCTAG
 
Protein sequence
MIVVMKAHAD LADRDAVLAR LAENKLKGHL SEGEERIVIG VVGAKIPAGL EEQLQSMSGV 
QTTLRITRPY KLAGREFQQH NTVIRIGDLE IGGGTPVVMA GPCSVESAEQ LLSTAHAVKA
AGANILRGGA FKPRTSPYAF RGLGEEGLKI LAQAREETGL PIITEALNTR DVELVARYTD
IIQLGARNMQ NFALLEEAGQ TGKPIMVKRG PSATVEEWLL AAEYILATGN RNVILCERGI
RTYETATRNT LDLNAVAVAK RRTHLPVIAD PSHGTGKWYL VQPMALAGLA AGADGLMIEV
HHDPDRASSD GPQSLNHHNF AQLMQQVRRL IAALEPELAV A