Gene GM21_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1517 
Symbol 
ID8136846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1772254 
End bp1773606 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content64% 
IMG OID644869129 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003021331 
Protein GI253700142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.00684484 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTAGCC AGAAATGGAC CAAATCGAGC TGGCGTTCTT TTCGCGCCTT GCAGCAACCG 
GTATGGCCGG CGGGATCGGC GCTGGAGGAG ACGCAGAAGA CGCTGTCCCA GCTCCCGCCG
CTGGTGTTCG CAGGCGAGTG CCAGACCCTG AAGGCGCAGT TGGCCGACGC CGTGGAAGGC
CGCGCTTTCG TGCTGCAATG CGGCGACTGC GCCGAGGATT TTTCGCGCTG CACCGGCCCC
GACATCAGGG AGCTCTTGAA GGTGATCCTG CAGATGTCGG TGGTGCTCGC CTTTGCCGGC
GAGAAGCGGG TGATCAAGAT AGGGAGGATC GCCGGGCAGT ACGCGAAACC GCGCTCGTCC
GACACCGAGA TGGTGCACGG GATCGAGCTC CCCAGCTACC GCGGCGACAT GGTGAACAGT
CCCGAGCCGA CCCTGGAGGC GAGGACCCCC GACCCGCGCC GAATGCTGGA GGGGTATTAC
CGGGCAGCGG CGACGCTGAA CCTGGTTCGC TCCTTCACGC TGGGAGGATA CGCCTCGCTG
GAGCGGGTGC AGGCCTGGCA CCGCGCCTCG CTCGATGCCC TTCCGGCCGG GCAGAAGTAC
GAGGATCTGG TCCGGCAGAT CTGGAAGACC ATCAACTTCA TGACGGCCAT AGGCCTCGAC
CCTCAGCACA CCCCGCAGTT GAACCAGGTG ACGCTCTACA CCTCGCACGA GGCCCTCTTG
CTCGACTACG AGGAGGCGCT CACCCGTATG GATTCGACCA GCGGGGGGTG GTACGACTGC
AGCGCGCACA TGCTCTGGAT CGGCGACCGG ACGAGACAGC TGGACGGGGC GCATGTCGAG
TTCCTGCGCG GGGTGAAAAA CCCGCTGGGG ATGAAGGTGG GGCCAAGCTA CGACATCGAC
ACGGTGAAAG CGCTGGCGCA GCGGTTGAAC CCCGACAACG AGCCGGGGCG GCTCACTCTG
ATCACCCGCT TCGGCGCCGA CAAGATCGAT TCCTACCTCC CCAGGCTTTT GAAAGAGATG
AAGCAGGAGG GGTTCAAGGT GGTCTGGAGC TGCGACCCCA TGCACGGCAA CACCTACCAG
AACGAGTATG GCCAGAAATC CAGGAAGTTC GAGGACATCC TGCGGGAAAT CAAGAATTTC
CAGCAGATAC ACAAGGCGGA GGGGACCGTT GCCGGAGGGG TGCACCTGGA GCTCACCGGC
GACCACGTAA CCGAGTGCAC CGGGGGGAGC CGGCAGCTTT TGGACAAGCA TCTGCACCTG
AACTACCAGA CCAACTGCGA TCCGCGCCTG AACGCCGAGC AGAGCGTGGA GCTCGCCTTC
GAGCTTGCCG AGATGCTGCA TCCCTGCAAG TGA
 
Protein sequence
MPSQKWTKSS WRSFRALQQP VWPAGSALEE TQKTLSQLPP LVFAGECQTL KAQLADAVEG 
RAFVLQCGDC AEDFSRCTGP DIRELLKVIL QMSVVLAFAG EKRVIKIGRI AGQYAKPRSS
DTEMVHGIEL PSYRGDMVNS PEPTLEARTP DPRRMLEGYY RAAATLNLVR SFTLGGYASL
ERVQAWHRAS LDALPAGQKY EDLVRQIWKT INFMTAIGLD PQHTPQLNQV TLYTSHEALL
LDYEEALTRM DSTSGGWYDC SAHMLWIGDR TRQLDGAHVE FLRGVKNPLG MKVGPSYDID
TVKALAQRLN PDNEPGRLTL ITRFGADKID SYLPRLLKEM KQEGFKVVWS CDPMHGNTYQ
NEYGQKSRKF EDILREIKNF QQIHKAEGTV AGGVHLELTG DHVTECTGGS RQLLDKHLHL
NYQTNCDPRL NAEQSVELAF ELAEMLHPCK