Gene GM21_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1889 
Symbol 
ID8137223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2195324 
End bp2198314 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content65% 
IMG OID644869503 
ProductPhosphoribosylformylglycinamidine synthase 
Protein accessionYP_003021700 
Protein GI253700511 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACA GGATTGAAAT CACCCTAAAG GACGAGGTCC GCGACCCTCG CGGCGAACGT 
GTCAAGCGGG AGATCGAGCA TTTTCTCCAT CTCAACGTGG ACCAGGTCCG GACCATAGAC
GTATATACGG TTGACGCGCA GCTGACCCCC GAGGAACTGG GGCAGGTTGC GGCCGGCCCC
TTCAGCGACC CGGTCATCCA GAACTACAGC GTCGGCAAAC CCGCCGCTTC CGGCTTCGAC
TATCTGGTGG AGGTCGGGTT CCGTCCCGGC GTCACCGACA ACGTGGGGCG CACCGCGGGC
GAGGCTATCG GATACCTCTT GGGCCGCCCG CTCGCTTTGG GCGAGGGGGT CTACACCTCG
GTGCAGTACC TGATCTCCGG GAAACTCTCC CGAGAGGACG CCGAGAAGAT CGCCACCGGT
CTTTTGTGCA ACACGCTGAT CCAGCGCTAC CAGATCCTCG ACGCCGCCTC CTTCAAGGCG
CAGGGGGGCG TGGCTCCCTT CGTGCCCAAG GTGCAGGGGG AGGCGAAGGT CGAGGTGAAC
AGCGTCAACC TCGAGGTCTC CGACGAGGAG TTGATGCGCA TCAGCCGCGA CGGCGTGCTG
GCGCTCACCC TGGACGAGAT GAAGATCATC CAGGCGCACT ACCGCGACCC GAAGGTGCTT
GAGGCGCGTA AAAACGTCGG CCTTGGCGCG GCGCCCACCG ACGTGGAGCT AGAGGCGCTG
GCGCAGACCT GGTCCGAGCA CTGCAAGCAC AAGATCTTCT CCGCCGACGT CTCCTACGAG
GACGGCGAGG GGAACCGCGA GGAGATCAAG TCGCTGTTCA AGAGCTTCAT CCAGAAGACC
ACCGCCGACG TGCGCGCCGC TCTCGGCGAC AAGGATTACT GCCTCTCCGT CTTCAAGGAC
AACGCCGGCG TCATCCGCTT CAACGACGAC TGGTCGCTGG TCTTCAAGGT CGAAACCCAC
AACTCCCCCT CCGCGCTCGA CCCCTACGGC GGGGCGCTGA CCGGCATCGT CGGCGTCAAC
CGCGACCCGT TCGGGACCGG CAAGGGTGCC AAGCTCATCT TCAACACCGA CGTCTTCTGC
TTCGCCGATC CTTTCTACGA AAAAGAGCTT CCCAAGCGCC TCCTGCACCC GCGCCGGATC
TACGAGGGGG TCGTGGAAGG GGTGGAGCAC GGCGGCAACA AAAGCGGCAT CCCGACCGTC
AACGGCTCGC TGGTCTTCGA CGACCGCTTC GCCGGAAAGC CGCTGGTATT TTGCGGCACC
GCCGGGATCA TGCCCGCGAA GATCAAGGAT GAGCCCTCGC ACCAGAAAAA GATCGTCCCG
GGCGACCTGA TCGTGATGAC CGGCGGCCGG ATCGGCAAGG ACGGCATCCA CGGCGCCACA
TTCTCCTCCG AGGAGCTGAA CGAGAATTCG CCGGTGTCCG CGGTGCAGAT CGGCGACCCG
ATCACCCAGA AGAGGATGTT CGACTTCCTG ATCCGCGCCC GCGACAAGGG GCTTTACCGC
TTCATCACCG ACAACGGGGC GGGGGGGCTT TCCTCCTCCA TCGGCGAGAT GTCCGAGGAG
TGCGGCGGCT GCCAGTTGGA TCTCTCCCTG GCGCCCCTTA AGTACCCGGG GCTCGCCCCC
TGGGAGATCC TGATCTCCGA GGCCCAGGAG CGCATGAGCC TCGCGGTCCC GCCGGAGAAC
ATCGACGCCT TCATGGAGAT GGCCAAGCGC TTCAACGTGG AGGCGACCGT GCTCGGGAGC
TTCACCGATT CCGGCATCTT CCACATGCTC TACGGCGAGA AGACCGTGGC CTACCTGCCG
CTTTCCTTCA TGCACGCGGG GCTGCCGCCG ATGCAGATCC CGGCCCGCTG GTCAAAGCCG
GTGCACGAAG AGCCGGCCGT CGCGGAAGCA GCCGACTACT CAGGCGATCT GAAGGGGCTT
CTCTCCTCGC TCAATATCTG CTCCAAGGAG AGCGTGGTGC GCCGCTACGA CCACGAGGTA
CAGGGGGGTA GCGTCGTGAA ACCCTTCACC GGCGTCGACA ACGACGGCCC GTCCGACGCC
GCCGTGGTGC GCCCGATCCT CGACTCCTTC GAGGCGGTCG TCGTGGGGCA CGGCATCTGC
CCGCGCTACA GCGACATCGA CGCCTACGAC ATGGCGGCCA ACGCCATCGA CGAGGGGCTC
AGGAACTACG TCGCGGTGGG GGGCTCCCTC GACCTTCTGG CCGGTCTGGA CAACTTCTGC
TGGTGCGACC CGGTCCTTTC CGACCGGACC CCGGACGGGC CCTACAAGAT GGCCCAGTTG
GTGCGCGCCA ACAAGGCGCT CTACGACTAC TGCACCGTCT TCTCCATGCC GCTCATCTCA
GGCAAGGACT CGATGAAGAA CGACTTCTAC GACGGCACCA CCAAGATCTC CATTCCGCCG
ACCCTGCTCT TCTCGGTCAT CGGCAAGATG GAGGATGCGC GTCTGGCCGT CACCATGGAC
GCGAAGCGCG CAGGCGACCA GGTGTACCTC CTGGGCGCTA CCGCGAACGA ACTTGGCGCT
TCCGAGTACC TGGCGCAGAA GGGATATGTC GGCAACAACG TCCCCAAGGT CGACGCGAAC
GCCGCCCTCG CCACCTACCG CGCCTACCAC GGCGCGCTTA ACGCCGGGCT CGTCGCCTCC
TGCCATGACC TCTCCGACGG CGGGCTCGCC GTGGCCGCAG CCGAGAGCGC CTTTGCAGGC
GGCTTCGGCA TGGAAATAGA TCTGGCCAAG GTCGCCTTCA AGGGGGACGC CGCCGACAAG
AAGGACGCGG TCCTTCTCTT CTCCGAGTCC GCTTCGAGGC TTCTTGTGAC GGTACGCCCC
GAGAAGGTCG AGGCGTTCGA GAAGGCGCTC TCCGGGACCA CCTTCGCGAA GATCGGCGCG
GTGACCGAGG CGAAGAACGT TTCGGTGAAG GGGCTTTCCG GTAAGGTGGT GGTCGACTCC
GTCATTTCCG AGCTGAAGGC CGCCTGGCAG GAGCCGCTGA AGGATCTGTA A
 
Protein sequence
MPHRIEITLK DEVRDPRGER VKREIEHFLH LNVDQVRTID VYTVDAQLTP EELGQVAAGP 
FSDPVIQNYS VGKPAASGFD YLVEVGFRPG VTDNVGRTAG EAIGYLLGRP LALGEGVYTS
VQYLISGKLS REDAEKIATG LLCNTLIQRY QILDAASFKA QGGVAPFVPK VQGEAKVEVN
SVNLEVSDEE LMRISRDGVL ALTLDEMKII QAHYRDPKVL EARKNVGLGA APTDVELEAL
AQTWSEHCKH KIFSADVSYE DGEGNREEIK SLFKSFIQKT TADVRAALGD KDYCLSVFKD
NAGVIRFNDD WSLVFKVETH NSPSALDPYG GALTGIVGVN RDPFGTGKGA KLIFNTDVFC
FADPFYEKEL PKRLLHPRRI YEGVVEGVEH GGNKSGIPTV NGSLVFDDRF AGKPLVFCGT
AGIMPAKIKD EPSHQKKIVP GDLIVMTGGR IGKDGIHGAT FSSEELNENS PVSAVQIGDP
ITQKRMFDFL IRARDKGLYR FITDNGAGGL SSSIGEMSEE CGGCQLDLSL APLKYPGLAP
WEILISEAQE RMSLAVPPEN IDAFMEMAKR FNVEATVLGS FTDSGIFHML YGEKTVAYLP
LSFMHAGLPP MQIPARWSKP VHEEPAVAEA ADYSGDLKGL LSSLNICSKE SVVRRYDHEV
QGGSVVKPFT GVDNDGPSDA AVVRPILDSF EAVVVGHGIC PRYSDIDAYD MAANAIDEGL
RNYVAVGGSL DLLAGLDNFC WCDPVLSDRT PDGPYKMAQL VRANKALYDY CTVFSMPLIS
GKDSMKNDFY DGTTKISIPP TLLFSVIGKM EDARLAVTMD AKRAGDQVYL LGATANELGA
SEYLAQKGYV GNNVPKVDAN AALATYRAYH GALNAGLVAS CHDLSDGGLA VAAAESAFAG
GFGMEIDLAK VAFKGDAADK KDAVLLFSES ASRLLVTVRP EKVEAFEKAL SGTTFAKIGA
VTEAKNVSVK GLSGKVVVDS VISELKAAWQ EPLKDL