Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1889 |
Symbol | |
ID | 8137223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2195324 |
End bp | 2198314 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869503 |
Product | Phosphoribosylformylglycinamidine synthase |
Protein accession | YP_003021700 |
Protein GI | 253700511 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain |
TIGRFAM ID | [TIGR01736] phosphoribosylformylglycinamidine synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCACA GGATTGAAAT CACCCTAAAG GACGAGGTCC GCGACCCTCG CGGCGAACGT GTCAAGCGGG AGATCGAGCA TTTTCTCCAT CTCAACGTGG ACCAGGTCCG GACCATAGAC GTATATACGG TTGACGCGCA GCTGACCCCC GAGGAACTGG GGCAGGTTGC GGCCGGCCCC TTCAGCGACC CGGTCATCCA GAACTACAGC GTCGGCAAAC CCGCCGCTTC CGGCTTCGAC TATCTGGTGG AGGTCGGGTT CCGTCCCGGC GTCACCGACA ACGTGGGGCG CACCGCGGGC GAGGCTATCG GATACCTCTT GGGCCGCCCG CTCGCTTTGG GCGAGGGGGT CTACACCTCG GTGCAGTACC TGATCTCCGG GAAACTCTCC CGAGAGGACG CCGAGAAGAT CGCCACCGGT CTTTTGTGCA ACACGCTGAT CCAGCGCTAC CAGATCCTCG ACGCCGCCTC CTTCAAGGCG CAGGGGGGCG TGGCTCCCTT CGTGCCCAAG GTGCAGGGGG AGGCGAAGGT CGAGGTGAAC AGCGTCAACC TCGAGGTCTC CGACGAGGAG TTGATGCGCA TCAGCCGCGA CGGCGTGCTG GCGCTCACCC TGGACGAGAT GAAGATCATC CAGGCGCACT ACCGCGACCC GAAGGTGCTT GAGGCGCGTA AAAACGTCGG CCTTGGCGCG GCGCCCACCG ACGTGGAGCT AGAGGCGCTG GCGCAGACCT GGTCCGAGCA CTGCAAGCAC AAGATCTTCT CCGCCGACGT CTCCTACGAG GACGGCGAGG GGAACCGCGA GGAGATCAAG TCGCTGTTCA AGAGCTTCAT CCAGAAGACC ACCGCCGACG TGCGCGCCGC TCTCGGCGAC AAGGATTACT GCCTCTCCGT CTTCAAGGAC AACGCCGGCG TCATCCGCTT CAACGACGAC TGGTCGCTGG TCTTCAAGGT CGAAACCCAC AACTCCCCCT CCGCGCTCGA CCCCTACGGC GGGGCGCTGA CCGGCATCGT CGGCGTCAAC CGCGACCCGT TCGGGACCGG CAAGGGTGCC AAGCTCATCT TCAACACCGA CGTCTTCTGC TTCGCCGATC CTTTCTACGA AAAAGAGCTT CCCAAGCGCC TCCTGCACCC GCGCCGGATC TACGAGGGGG TCGTGGAAGG GGTGGAGCAC GGCGGCAACA AAAGCGGCAT CCCGACCGTC AACGGCTCGC TGGTCTTCGA CGACCGCTTC GCCGGAAAGC CGCTGGTATT TTGCGGCACC GCCGGGATCA TGCCCGCGAA GATCAAGGAT GAGCCCTCGC ACCAGAAAAA GATCGTCCCG GGCGACCTGA TCGTGATGAC CGGCGGCCGG ATCGGCAAGG ACGGCATCCA CGGCGCCACA TTCTCCTCCG AGGAGCTGAA CGAGAATTCG CCGGTGTCCG CGGTGCAGAT CGGCGACCCG ATCACCCAGA AGAGGATGTT CGACTTCCTG ATCCGCGCCC GCGACAAGGG GCTTTACCGC TTCATCACCG ACAACGGGGC GGGGGGGCTT TCCTCCTCCA TCGGCGAGAT GTCCGAGGAG TGCGGCGGCT GCCAGTTGGA TCTCTCCCTG GCGCCCCTTA AGTACCCGGG GCTCGCCCCC TGGGAGATCC TGATCTCCGA GGCCCAGGAG CGCATGAGCC TCGCGGTCCC GCCGGAGAAC ATCGACGCCT TCATGGAGAT GGCCAAGCGC TTCAACGTGG AGGCGACCGT GCTCGGGAGC TTCACCGATT CCGGCATCTT CCACATGCTC TACGGCGAGA AGACCGTGGC CTACCTGCCG CTTTCCTTCA TGCACGCGGG GCTGCCGCCG ATGCAGATCC CGGCCCGCTG GTCAAAGCCG GTGCACGAAG AGCCGGCCGT CGCGGAAGCA GCCGACTACT CAGGCGATCT GAAGGGGCTT CTCTCCTCGC TCAATATCTG CTCCAAGGAG AGCGTGGTGC GCCGCTACGA CCACGAGGTA CAGGGGGGTA GCGTCGTGAA ACCCTTCACC GGCGTCGACA ACGACGGCCC GTCCGACGCC GCCGTGGTGC GCCCGATCCT CGACTCCTTC GAGGCGGTCG TCGTGGGGCA CGGCATCTGC CCGCGCTACA GCGACATCGA CGCCTACGAC ATGGCGGCCA ACGCCATCGA CGAGGGGCTC AGGAACTACG TCGCGGTGGG GGGCTCCCTC GACCTTCTGG CCGGTCTGGA CAACTTCTGC TGGTGCGACC CGGTCCTTTC CGACCGGACC CCGGACGGGC CCTACAAGAT GGCCCAGTTG GTGCGCGCCA ACAAGGCGCT CTACGACTAC TGCACCGTCT TCTCCATGCC GCTCATCTCA GGCAAGGACT CGATGAAGAA CGACTTCTAC GACGGCACCA CCAAGATCTC CATTCCGCCG ACCCTGCTCT TCTCGGTCAT CGGCAAGATG GAGGATGCGC GTCTGGCCGT CACCATGGAC GCGAAGCGCG CAGGCGACCA GGTGTACCTC CTGGGCGCTA CCGCGAACGA ACTTGGCGCT TCCGAGTACC TGGCGCAGAA GGGATATGTC GGCAACAACG TCCCCAAGGT CGACGCGAAC GCCGCCCTCG CCACCTACCG CGCCTACCAC GGCGCGCTTA ACGCCGGGCT CGTCGCCTCC TGCCATGACC TCTCCGACGG CGGGCTCGCC GTGGCCGCAG CCGAGAGCGC CTTTGCAGGC GGCTTCGGCA TGGAAATAGA TCTGGCCAAG GTCGCCTTCA AGGGGGACGC CGCCGACAAG AAGGACGCGG TCCTTCTCTT CTCCGAGTCC GCTTCGAGGC TTCTTGTGAC GGTACGCCCC GAGAAGGTCG AGGCGTTCGA GAAGGCGCTC TCCGGGACCA CCTTCGCGAA GATCGGCGCG GTGACCGAGG CGAAGAACGT TTCGGTGAAG GGGCTTTCCG GTAAGGTGGT GGTCGACTCC GTCATTTCCG AGCTGAAGGC CGCCTGGCAG GAGCCGCTGA AGGATCTGTA A
|
Protein sequence | MPHRIEITLK DEVRDPRGER VKREIEHFLH LNVDQVRTID VYTVDAQLTP EELGQVAAGP FSDPVIQNYS VGKPAASGFD YLVEVGFRPG VTDNVGRTAG EAIGYLLGRP LALGEGVYTS VQYLISGKLS REDAEKIATG LLCNTLIQRY QILDAASFKA QGGVAPFVPK VQGEAKVEVN SVNLEVSDEE LMRISRDGVL ALTLDEMKII QAHYRDPKVL EARKNVGLGA APTDVELEAL AQTWSEHCKH KIFSADVSYE DGEGNREEIK SLFKSFIQKT TADVRAALGD KDYCLSVFKD NAGVIRFNDD WSLVFKVETH NSPSALDPYG GALTGIVGVN RDPFGTGKGA KLIFNTDVFC FADPFYEKEL PKRLLHPRRI YEGVVEGVEH GGNKSGIPTV NGSLVFDDRF AGKPLVFCGT AGIMPAKIKD EPSHQKKIVP GDLIVMTGGR IGKDGIHGAT FSSEELNENS PVSAVQIGDP ITQKRMFDFL IRARDKGLYR FITDNGAGGL SSSIGEMSEE CGGCQLDLSL APLKYPGLAP WEILISEAQE RMSLAVPPEN IDAFMEMAKR FNVEATVLGS FTDSGIFHML YGEKTVAYLP LSFMHAGLPP MQIPARWSKP VHEEPAVAEA ADYSGDLKGL LSSLNICSKE SVVRRYDHEV QGGSVVKPFT GVDNDGPSDA AVVRPILDSF EAVVVGHGIC PRYSDIDAYD MAANAIDEGL RNYVAVGGSL DLLAGLDNFC WCDPVLSDRT PDGPYKMAQL VRANKALYDY CTVFSMPLIS GKDSMKNDFY DGTTKISIPP TLLFSVIGKM EDARLAVTMD AKRAGDQVYL LGATANELGA SEYLAQKGYV GNNVPKVDAN AALATYRAYH GALNAGLVAS CHDLSDGGLA VAAAESAFAG GFGMEIDLAK VAFKGDAADK KDAVLLFSES ASRLLVTVRP EKVEAFEKAL SGTTFAKIGA VTEAKNVSVK GLSGKVVVDS VISELKAAWQ EPLKDL
|
| |