Gene GM21_3615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3615 
Symbol 
ID8138988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4195208 
End bp4196233 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID644871235 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_003023394 
Protein GI253702205 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones127 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTAG TCATGAAAGC AGGAGCGGCT AAAAAAGAAC GCGACGCGGT GACCAAGAGG 
ATCAAGGAAC TGGGGTACAC CCCGCACATC ATCCATGGCA CCACCCGCGA CGTGATCGGC
GCCGTCGGGG ACGAGCGTGG CAAGAGCGTC TTGCAGACGC TGGAGTCGAT GCAGGGGGTC
GAGAGCGTCG TCCCCATCCT GCAGCCCTAC AAGCTCGCCT CCAAGGAAGT GAAGAAAGAG
GCCAGCATCG TCCGCATCTC CGACACCCTC GCCATCGGCG GGAAAGAACT GGTGGTGATG
GCTGGGCCCT GCTCCGTCGA GAACGAGGAG CAGATCATCG AGTCGGCCCT CGCCGTCAAG
GCGGCCGGCG CCCAGATGCT GCGCGGCGGC GCCTTCAAGC CGCGTACCTC CCCCTACTCC
TTCCAGGGAC TGGAGGAAGA GGGGCTGAAG CTTTTGGCGA AGGCCCGTGA GATAACCGGC
CTCCCCTTCG TCACCGAGAT CATCGATCCC GATTCCGTCG ACCTCGTCGC CTCCTACTCA
GACATGCTCC AGATCGGGGC GCGCAACTCC CAGAACTTCG CCCTCCTGAA GAAAGTCGGG
CAGATCGACA AGCCGATCCT TTTGAAGCGC GGCATGTCGA TGACCATCCA GGAGTTCCTG
ATGAGCGCCG AGTACATCAT GAGCGAGGGG AACCAGTCCG TCATCCTCTG CGAGCGCGGC
ATCAGGACCT TCGAGACCGC CACCAGGAAC ACCCTCGATC TCTCCGCCAT TCCCGTTTTG
AAGTCGAAGA CGCATCTTCC CATCGTCATC GACCCCTCCC ACGGCACCGG CAACTACCAC
TACATAGCGC CCATGTCCTT CGCCGCGGTT GCCGCCGGCG CCGACGGTCT CATCATCGAA
GTGCACCCCG ATCCGGAGAA GGCCTCCTCC GACGGGCCGC AGTCGCTCAA ACCTAAGAAG
TTCCAGGCAT TGATGGACAA GCTGCGGCTC TTCGCGGAAG CCGCGGACAA GACACTCGGC
GCGTAG
 
Protein sequence
MIVVMKAGAA KKERDAVTKR IKELGYTPHI IHGTTRDVIG AVGDERGKSV LQTLESMQGV 
ESVVPILQPY KLASKEVKKE ASIVRISDTL AIGGKELVVM AGPCSVENEE QIIESALAVK
AAGAQMLRGG AFKPRTSPYS FQGLEEEGLK LLAKAREITG LPFVTEIIDP DSVDLVASYS
DMLQIGARNS QNFALLKKVG QIDKPILLKR GMSMTIQEFL MSAEYIMSEG NQSVILCERG
IRTFETATRN TLDLSAIPVL KSKTHLPIVI DPSHGTGNYH YIAPMSFAAV AAGADGLIIE
VHPDPEKASS DGPQSLKPKK FQALMDKLRL FAEAADKTLG A