Gene Anae109_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1710 
Symbol 
ID5375894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1924755 
End bp1925825 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID640843219 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001378898 
Protein GI153004573 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0929755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGA CCGAGAACCT GAACGTCGCG GCACTGGACG TGATGCCCTC GCCCGACGAG 
GTGAAGGCGC GCGTCCCGCT CGACGAGCCC GCCGCCCGCA CCGTCGTCGA GGGGCGCCGC
ACGCTCGAGG CGATCCTCGA CCGGCGAGAC CCGCGGCTCT TCGTGGTGGT CGGGCCGTGC
TCGATCCATG ACCCGGCCGC CGGGCTGGAC TACGCGCGCC GGCTGCGCGT GCTCGCCGAC
GAGGTCTCGC GGACGCTGTA CGTCGTGATG CGCGTCTACT TCGAGAAGCC GCGCACGTCC
ATCGGGTGGA AGGGGTTCAT CAACGACCCG CGCATGGACG ACTCCTTCCG GATCGACGAG
GGCATGGAGC GCGCGCGCCG CTTCCTGCTC GACGTGAACG AGCTCGGGCT CCCCGCGGCC
ACCGAGGCCC TCGACCCGAT CGCGCCGCAG TACTACGGCG ACCTCATCGC CTGGACCGCC
ATCGGCGCGC GCACCTCCGA GTCGCAGACG CACCGCGAGA TGTCGTCGGG CCTCTCCACG
CCGGTGGGCT TCAAGAACGG CACGGACGGC GACCTCGAGG CCGCCGTGAA CGGCATCCTC
TCGGCCGGGA GCCCGCACGC CTTCCTCGGC ATCAACGGCC AGGGCCGCTC CGCCATCGTC
CGCACGAACG GCAACCGCTA CGGGCACATC GTCCTGCGCG GCGGCGGCGG GCGCCCCAAC
TACGACACGG TGTCCATCGC GCTCGCGGAG GAGGCGCTCG CGAAGGCGAA GCTCCCGAAG
AACGTGGTGG TGGACTGCTC GCACGCGAAC TCGCGCAAGA AGCCGGAGCT GCAGCCGCTC
GTGCTGCGCG ACGTGGTCCA CCAGATCCGC GAGGGCAACC GCTCGGTGGT GGGCCTCATG
CTGGAGAGCT TCCTCGAGGC GGGGAGCCAG CCCATCCCGG CGGACCGGTC GCAGCTCCGC
TACGGCTGCT CCGTCACCGA CGCCTGCATC GGCTGGGACA CCACCGTCGA GGTGCTGCGC
TGGGCCGACG ACGTGCTGCG GGACGTGCTG CCGCGCCGCG CGGCGCCGTG A
 
Protein sequence
MQQTENLNVA ALDVMPSPDE VKARVPLDEP AARTVVEGRR TLEAILDRRD PRLFVVVGPC 
SIHDPAAGLD YARRLRVLAD EVSRTLYVVM RVYFEKPRTS IGWKGFINDP RMDDSFRIDE
GMERARRFLL DVNELGLPAA TEALDPIAPQ YYGDLIAWTA IGARTSESQT HREMSSGLST
PVGFKNGTDG DLEAAVNGIL SAGSPHAFLG INGQGRSAIV RTNGNRYGHI VLRGGGGRPN
YDTVSIALAE EALAKAKLPK NVVVDCSHAN SRKKPELQPL VLRDVVHQIR EGNRSVVGLM
LESFLEAGSQ PIPADRSQLR YGCSVTDACI GWDTTVEVLR WADDVLRDVL PRRAAP