Gene Anae109_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0368 
Symbol 
ID5377718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp421511 
End bp422533 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content71% 
IMG OID640841877 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001377567 
Protein GI153003242 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.511189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.239005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTCG CGATGAAGCC GCACGCGAGC CCGGCCGAGT TCCAGGCGGT GCTCGAGAAG 
ATCCGCTCGC TCGGCCTCAC CCCGCAGCCG ATCACCGGGA CGGAGCGCAA GGTGGTCGCC
GTGATCGGCC ACACCACCGG CCTCGATCCG GACGACCTGT TCGGCTCCAT GCCGGGCGTG
GCGGAGGCGC TGCGGGTGTC GCAGCCCTTC AAGCTCGTCT CGCGCGAGGT GAAGGAGGAG
GACACGATCA TCGACGTGGG CGGCGTCACG CTCGGCGGCA AGGCCATCGC GGTCATGGCC
GGCCCCTGCT CGGTCGAGTC GAAGGACCAG ATCCTCGAGG CGGCCCACGC CGTGAAGGCG
GCGGGCGCGA CCTTCCTGCG CGGCGGCGCC TTCAAGCCGC GCACCAGCCC CTACGAGTTC
CAGGGGCTGC GCGAGGAGGG GCTGAAGCTG CTCGCGCTGG CGCGCGAGGC CACGGGCCTC
AAGGTCGTCA CCGAGGTGAA GGACACCGAG ACCCTGCCGA TGGTGGCGGA GTACGCCGAC
GTGCTGCAGG TCGGCGCCCG CAACATGCAG AACTACTCGC TGCTCGAGCG GCTCGGCGCG
GTCGAGAAGC CCATCCTCCT GAAGCGCGGC CTCTCCGCCA CCCTGAAGGA GTGGCTCATG
GCGGCGGAGT ACATCGTCTC CAAGGGGAAC TTCCAGGTCG CGCTGTGCGA GCGCGGCATC
CGGACCTTCG AGACCATGAC CCGCAACACC CTCGACATCA ACGCCGTGCC GGTCCTGAAG
GCGCTCACGC ACCTGCCGGT GGTGGTGGAT CCGTCGCACG GCATCGGGCT CCGCCCGCAC
GTCCCCGCCA TCGCGCGGGC AGGGATCGCC GCCGGCGCGG ACGGCCTCAT CATCGAGGTC
CACCCGTGCC CGGAGAAGGC GCTCTCCGAC GGCCACCAGT CGCTCACGCC CGCCGAGTTC
GAGGAGCTCA TGCGCCAGGT CCGGGTGATC GCCGGCGCGA TCGGCCGGCC GGTCGGCGCC
TGA
 
Protein sequence
MFVAMKPHAS PAEFQAVLEK IRSLGLTPQP ITGTERKVVA VIGHTTGLDP DDLFGSMPGV 
AEALRVSQPF KLVSREVKEE DTIIDVGGVT LGGKAIAVMA GPCSVESKDQ ILEAAHAVKA
AGATFLRGGA FKPRTSPYEF QGLREEGLKL LALAREATGL KVVTEVKDTE TLPMVAEYAD
VLQVGARNMQ NYSLLERLGA VEKPILLKRG LSATLKEWLM AAEYIVSKGN FQVALCERGI
RTFETMTRNT LDINAVPVLK ALTHLPVVVD PSHGIGLRPH VPAIARAGIA AGADGLIIEV
HPCPEKALSD GHQSLTPAEF EELMRQVRVI AGAIGRPVGA