Gene Anae109_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3094 
Symbol 
ID5374273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3617595 
End bp3619022 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content73% 
IMG OID640844618 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001380274 
Protein GI153005949 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.307275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGACG AACGACAGAA ACGAACCTCC CTGCCCGAGT GCATGACGCT GCGCGACGTG 
CGCACGTCGA TCGACGAGGT GGACCGGCGC ATCGTGGCGC TCCTCGCGGA GCGGCGCGGG
TACGCGCTGC AGGCCGCGCG CTTCAAGAGC GCCGCCGACG GCGTGAAGGA CCCTTCGCGC
GAGGAGCAGG TCATCGCGAA CGTGCGGGCG CTCGCCGGCG AGGAGGGCAT CGAGCCCGAC
CTCGTCGAGA TGCTCTACCG CGACATGATC GCGGGCTTCG TGCGCGTCGA GCTCGCGTCC
GGCGGCCACC GCGCGCCGCC GGTGATCGAG AACGTCAACG TCGCCGCCTT CGACGCGATG
CTCCCGCCGG AGGAGGTGAA GCTCCGCATC CCGGTGTCCG AGCGGGCCGC GCGGACGGTC
GTCGAGGGGC GCCGCACCGT GGAGGCGATC CTCGACCGGA CGGATCCGCG CCTGCTCGTG
GTGGTCGGCC CCTGCTCGAT CCACGACCCC GTCGCCGGGC TCGACTACGC CCACCGCCTG
CGCGCGCTGG CGGACGAGCT CTCCGACACG CTCTACCTCG TGATGCGCGT CTACTTCGAG
AAGCCGCGCA CGTCGGTGGG CTGGGAGGGG CTCACGAACG ATCCGCACAT GAACGACTCC
TTCCAGGTGA AGGAGGGCAT GGAGCGGGCG CGCCGGTTCC TGCTCGAGGT GAGCGATCTG
GGCCTGCCCA CCGGGACGGA GGCGCTCGAT CCCATCTCCC CGCACTACCG CGGCGACCTC
GTCACCTGGA CCGCCATCGG GGCGCGCACC TCGGAGTCGC AGACGCACCG CAACCTCGCC
TCCGGGCTCT CCACGCCCGT CGGGTTCAAG AACGGCACCG ACGGCGAGGT GGACGGCGCG
GTGAACGCCA TCCTGGCCGC GGCCCGGCCC CACGCTTTCC TGGGCATCAA CGACCAGGGA
CGCTCCGCCG TGATCCGCAC GCGCGGCAAC CGCCACGGCC ACCTGGTGCT GCGCGGCGGC
GGCGGCCGGC CCAACTTCGA CAGCGTCTCG GTGGCCATCG CCGAGCAGGC GCTCGCGAAG
GCGGGACTCC CGCAGACGAT CGTCATCGAC TGCTCGCACG CGAACTCCTG GAAGAAGCCG
GAGCTCCAGC CGCTCGTCCT GCGCGACGTG GCGAGCCAGC TCCGCCAGGG GAACCGGTCC
ATCGCGGGGA TCATGCTGGA GAGCTTCCTC GAGCAGGGGA GCCAGCCGAT GTCGGCCGAT
CCGGCGCAGC TCCGCTACGG CCGCTCGGTC ACGGACCCTT GCCTCGGCTG GGACGAGACC
GCCGCGGCGC TCCGCGAGGC GCGCAGCCTG CTGCGCGGCG TGGTGGAGGA GCGGCGTCGG
GCCGCCGACG CGCCGCCATC GGCGTCGCCT CGCGCCGCGG CGAGCTGA
 
Protein sequence
MNDERQKRTS LPECMTLRDV RTSIDEVDRR IVALLAERRG YALQAARFKS AADGVKDPSR 
EEQVIANVRA LAGEEGIEPD LVEMLYRDMI AGFVRVELAS GGHRAPPVIE NVNVAAFDAM
LPPEEVKLRI PVSERAARTV VEGRRTVEAI LDRTDPRLLV VVGPCSIHDP VAGLDYAHRL
RALADELSDT LYLVMRVYFE KPRTSVGWEG LTNDPHMNDS FQVKEGMERA RRFLLEVSDL
GLPTGTEALD PISPHYRGDL VTWTAIGART SESQTHRNLA SGLSTPVGFK NGTDGEVDGA
VNAILAAARP HAFLGINDQG RSAVIRTRGN RHGHLVLRGG GGRPNFDSVS VAIAEQALAK
AGLPQTIVID CSHANSWKKP ELQPLVLRDV ASQLRQGNRS IAGIMLESFL EQGSQPMSAD
PAQLRYGRSV TDPCLGWDET AAALREARSL LRGVVEERRR AADAPPSASP RAAAS