Gene GM21_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2255 
Symbol 
ID8137595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2628299 
End bp2629387 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content62% 
IMG OID644869870 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003022062 
Protein GI253700873 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGA CCAACAATCT CAAGGTAAGA AGCATCACTC CGATCATTGC ACCTACCGAT 
CTCAGACAGG TCTTCCCCAT GTCCGAGAAA TCCGGGACCT GCGTGAGCAG CAGCCGCGCA
GCCATCACCA GGATCTTGAA GGGAGAGGAC AAGCGGCTCA TGGTGGTCGT CGGCCCCTGC
TCCATTCACG ACCCCAAGGG AGCCCTTGAG TACGCCGAGA AGCTCGCGGC GCTCGCCAAG
GAGGTCTCCG AGGAAATGCT GCTGATCATG CGCGTGTACT TCGAGAAGCC GCGCACCACC
ATCGGGTGGA AGGGGCTCAT CAACGACCCG GACATGAACG GCACCCACCA GATCTCCAAG
GGGCTCGGCA TCGCCCGCGG CCTTCTCTGC AAGATCACCG AGATGGGGCT GCCGGTCGCG
ACCGAGATGC TCGACCCGAT CACCCCCGAG TACCTGGCCG ACCTCCTCTC CTGGGGCGCC
ATCGGGGCCC GCACCACCGA ATCACAGACC CACCGCGAGA TGGCTAGCGG CCTCTCCTTC
GCGATAGGGT TCAAAAACGG CACCGACGGC AACCTCCAGA TCGCCATCGA CGCCATGAAG
GCGGCGCTTC ATTCCCACAG CTTCCTCGGC ATCAACCGCG ACGGCCTGAC CTCCATCATC
CAGACCACCG GCAACCCCGA CGTGCACATG GTCCTGCGCG GCGGGAGCAA GAAGCCGAAC
TACTCCCCCG AGGACATCGC CAAATCCGAG GAGATGATCG CCAAGGCGGG ACTGACCCCG
ACCATGATGG TCGACTGCAG CCACGGCAAC TCCGAGAAGA AGTACGAGCG GCAACCCGAG
GTCATGAAGA GCGTGATCGA CCAGATCGCT GCCGGCAACC GCAGCATCTC CGGCGTGATG
ATCGAGAGCT ACCTGAAGGA AGGGAACCAG CCGATGCCCA AGGACGGCGA TCCCTCCTCC
TTAGCCTACG GCGTATCGAT CACCGACAGC TGCATCAACT GGGAGACCAC CGAGGCCACC
CTGCGCGAAG CCCACCGCAG ATTGAAAGCC TGCGGCGGGA GAAAGATCTC TTATATAGTT
AAAGGCTAA
 
Protein sequence
MIKTNNLKVR SITPIIAPTD LRQVFPMSEK SGTCVSSSRA AITRILKGED KRLMVVVGPC 
SIHDPKGALE YAEKLAALAK EVSEEMLLIM RVYFEKPRTT IGWKGLINDP DMNGTHQISK
GLGIARGLLC KITEMGLPVA TEMLDPITPE YLADLLSWGA IGARTTESQT HREMASGLSF
AIGFKNGTDG NLQIAIDAMK AALHSHSFLG INRDGLTSII QTTGNPDVHM VLRGGSKKPN
YSPEDIAKSE EMIAKAGLTP TMMVDCSHGN SEKKYERQPE VMKSVIDQIA AGNRSISGVM
IESYLKEGNQ PMPKDGDPSS LAYGVSITDS CINWETTEAT LREAHRRLKA CGGRKISYIV
KG