Gene Anae109_0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0367 
Symbol 
ID5377717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp420477 
End bp421493 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content72% 
IMG OID640841876 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001377566 
Protein GI153003241 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.719946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.246047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTCG TCATGAAGCC CCACGCCAGC GAGGGGGAGA TCGCCGCGGT CGTCGAGCGG 
ATCGCCTCCC TCGGGCTCAC CGCCCACCCC ATCCCGGGCG CGCAGCGCGT CGCCATCGGG
ATCACCGGCA ACAAGGGCGG CCTCGAGGCG GAGCTCTTCG AGACGATGCC GGGCGTGCAG
GAGGCGCTCC GCGTCTCGCA GCCCTTCAAG CTCGTGTCGC GCGAGGTGAA GGCGGACGAC
ACGGTCCTCG ACGTCGGCGG CGTCCCGCTC GGCGGGAACG CGCTCGCCAT CATGGCGGGG
CCGTGCTCGG TCGAGTCCCG CGAGCAGCTG CTCGAGGCGG CGCACGCGGT CCGCGCCGCA
GGCGCGCGCT TCCTCCGCGG CGGCGCCTAC AAGCCGCGCA CGAGCCCCTA CGAGTTCCAG
GGGCTCGCCG AGGAGGGCCT GAAGCTGCTC GCCCTCGCGC GCGAGGAGAC CGGCCTCAAG
GTGGTGACCG AGGTGATGGA CGTCGAGACG CTGCCGATGG TGTCCGAGTA CGCCGACGTC
CTCCAGATCG GCGCCCGGAA CATGCAGAAC TTCTCGCTCC TGAAGCAGCT CGGCGAGCTC
CGCAAGCCGG TGCTCCTGAA GCGCGGCCCC TCCGCCACCG TCAAGGAGTG GCTCATGGCC
GCCGAGTACG TGGTCTCGCG CGGCAACTAC CAGGTGGCGC TGTGCGAGCG CGGGATCCGC
ACGTTCGAGA CCATGACGCG CAACACGCTC GACCTGAACG CCGTGCCGGT GCTGAAGGCG
CTCACCCACC TTCCCGTGGT GGTGGACCCG TCGCACGGCA TCGGCCTGCG GGCCCACGTC
GCCGCCATGG CGCGGGCCGG GGTCGCCGCC GGCGCGGACG GCCTCATCGT CGAGGTCCAC
CCGCACCCGG AGAAGGCCCT CTCCGACGGG CAGCAGTCGC TCACGCCGCG CGAGTTCGAG
GAGCTCATGC GGCAGGTGCG CGTCATCGCC GGCGCGGTCG GCCGCGCCAT CGCCTGA
 
Protein sequence
MLVVMKPHAS EGEIAAVVER IASLGLTAHP IPGAQRVAIG ITGNKGGLEA ELFETMPGVQ 
EALRVSQPFK LVSREVKADD TVLDVGGVPL GGNALAIMAG PCSVESREQL LEAAHAVRAA
GARFLRGGAY KPRTSPYEFQ GLAEEGLKLL ALAREETGLK VVTEVMDVET LPMVSEYADV
LQIGARNMQN FSLLKQLGEL RKPVLLKRGP SATVKEWLMA AEYVVSRGNY QVALCERGIR
TFETMTRNTL DLNAVPVLKA LTHLPVVVDP SHGIGLRAHV AAMARAGVAA GADGLIVEVH
PHPEKALSDG QQSLTPREFE ELMRQVRVIA GAVGRAIA