Gene Gmet_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1941 
Symbol 
ID3741592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp2165657 
End bp2168647 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content63% 
IMG OID637779233 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_384895 
Protein GI78223148 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain
[COG1828] Phosphoribosylformylglycinamidine (FGAM) synthase, PurS component 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGAC GAATTGAGAT TGCCCTCAAG GAAGGCGTCC GTGACGCCCG TGGCGAGCGG 
ATCAAACGCG AGATCGAGCA TTTTCTCCAT CTGCCGGTGG AAGCGGTGCG GACCATCGAC
GTCTACACCG TTGATGCGAA GCTTTCCGAG GAGGAGCTTG TCAAGGCCGC CTCGGAGCCC
TTCTGCGACC CGGTCATCCA GGTCTGGAGC ATCGACCGTC CCCTGGCTGC CGGTTTCGAT
TTTCTCGTGG AGGTGGGATA CCGTCCCGGG GTTACCGACA ACGTGGGGCG CACCGGCCGG
GAAGCAATCG AGTACATCAC CGGCCGGCCC ATGGAGGCGG GGGAGGGGGT CTACACCTCG
GTCCAGTACC TTCTGAGCGG GAAACTCTCC CGTGCCGACG TGGAGCGGAT CGCCCGGGAC
CTCCTCGGCA ACGCCCTTAT CCAGCGCTTC GTGGTTCTGG ACAGGGCCGA GTTCGCGGCC
CAGGGGGGAG TGCCGGTTTC GGTCCCCAAA GTTCAGGGCG AAACCAAAGC CCAGGTCAGG
GATATCGACC TGGAGGTTTC CGACGATGAG CTGCTGCGTA TCAGCAAGGA CGGGGTTCTG
GCCCTGACCC TGGAGGAGAT GAAGATCATC CAGGCCCACT ACCGGGATCC GAAGGTGCAG
GAGGAGCGCC GGAAGCAGGG TCTCGGCGTT AGACCGACCG ACGTGGAACT GGAGTGCCTG
GCCCAGACCT GGTCCGAACA CTGCAAACAC AAGATCTTCG CCGGCACGGT TCACTACGAG
GATGAGCAGG GGAACCGGCA GGAGATCAAG TCCCTCTTTA AGTCGTTCAT CCAGCGCACC
ACAAAGGACG TGCGGGAAAA GCTGGGCGAC AGCGACTTTT GCCTCTCGGT CTTCAAGGAC
AACGCCGGGG TCATTCGGTT CAACGACGAC TGGTCCTTGG TCTTCAAGGT GGAGACCCAC
AACTCTCCAT CAGCCCTCGA CCCCTACGGC GGGGCTCTCA CCGGCATCGT CGGCGTGAAC
AGGGATCCCT TCGGCACCGG CAAAGGTGCC AAGCTCATCT TCAACACGGA CGTTTTCTGC
TTCGCCGATC CCTTCTACGA TAAGCCCTTG CCGTCGCGGC TCCTGCATCC CCGGCGCATC
TACGAAGGGG TCGTGGAGGG AGTCGAGCAC GGTGGCAACA AGAGCGGCAT CCCCACGGTG
AACGGCTCTC TCGTTTTTGA CGAGCGATTT GCCGGCAAGC CCCTGGTTTT CTGCGGCACG
GCCGGGATCA TGCCGGCCAC CCTGAACGGC GAGTTGGGCC ACGAGAAAGC AATCAAGCCG
GGTGACCTGA TCGTCATGAC CGGCGGCCGC ATCGGCAAGG ACGGCATCCA CGGCGCCACC
TTCTCCTCCG AGGAGCTGAA CGAGAATTCT CCGGTTACGG CGGTCCAGAT CGGCGACCCC
ATCACCCAGA AGCGGATGTT CGATTTCCTC ATCCGTGCCC GGGACAAGGG GCTCTACCGA
TTTATCACCG ACAACGGCGC CGGCGGGCTC TCCTCATCCA TCGGGGAAAT GTCGGGCGAG
TGCGGCGGCT GCCGGATGGA CCTGGAAAAG GCCCCCCTCA AGTATCCGGG GCTTGATCCG
TGGGAGATCC TCATCTCCGA AGCTCAGGAG CGGATGAGCC TTGCGGTGCC CCCCGAGCAG
ATCGACGAGT TTCTGGCCAT GGCGAAGCGC TTCAACGTCG AGGCGACGGT ATTGGGTGAG
TTCACCGACA GCGGCTACTT CCATATCCTC TACGGTGACC GCACCGTGGC CTGGCTCCCC
ATGGAGTTCA TGCACGAGGG GCTTCCCCCC ATGCAGCTTC CGGCCAAATG GGTGCCACCT
CGCCATGCCG AACCGACCCT TCAGGTGAAA GGTGATTACA CCGCGGACCT GAAGGCGCTC
CTCGGTTCCC TCAACATCTG CTCCAAGGAG TCGGTGGTGC GCCGTTACGA CCATGAGGTG
CAGGGGGGAA GCGTCGTGAA GCCCTTCACT GGCGTGGCCA ACGACGGCCC CTCCGATGCC
GCGGTGGTCC GCCCAATCCT CGACTCCTTC GAGGGGGTGG TGGTTGCCCA CGGCATCTGT
CCCCGCTACT CGGACATCGA CACCTACCAC ATGACCGCCA ACGCCATTGA CGAAGCCCTG
CGCAACTACG TGGCCGTGGG TGGCGCCCTG GATCTCGTGG CGGGGCTCGA TAACTTCTGC
TGGTGCGATC CGGTGAGGTC GGAAAAGACT CCCGACGGCG AGTACAAGAT GGCCCAACTG
GTGCGGGCGA ACCAGGCCCT CTACGACGTC TGCCTCGCCT ACAACCTGCC GCTCATCTCG
GGCAAGGACT CCATGAAAAA CGACTTCTAC GACGGGGCCA CCAAGATCTC CATCCCGCCG
ACCATCCTCT TCTCGGTCAT CGGCAAGATA GAAGACGCCC GCAGAAGCGT CACCATGGAT
GCCAAGCGCC CCGGCGACGT CGTTTACCTC CTGGGGAGAA CCGGCAACGA ACTGGGGGGC
TCCGAATATC TGGCATTGCA GGGTGTCATC GGCAACAACG TGCCGACGGT GAACCCCGAA
AAGGCCCTCA AGCGCTACAA CGCCCTGCAC ACGGCCATCA CCGGCGGGCT TGTGGCCTCC
TGCCACGACC TCTCCGACGG CGGCCTCGCC GTGGCCCTGG CCGAGAAGGC CTTTGCCGGC
GGTTACGGTA TTTCCGCGGA TCTGGGCAAG GTTCTCTGGA CGGGCGACGA CGCCATGCGC
AATGATGCGG CTCTGCTCTT CTCCGAGTCG GCGTCACGGC ACCTGGTCAC GGTGCGTCCC
GGGAACCGCG ATGCTTTCGA GGCGATCATG GCGGGGAACT GCTTCTCTGC CATCGGTGTG
GTGACGGAAG AGGGGGCGGT GCGTATCACG GGCCTTTCCG GCCAGGCGGT GGTCGAAGCA
AGCATCGACG AACTGAAAGA AGCGTGGCAG AGCCCGCTGA GGGAGCTGTA A
 
Protein sequence
MARRIEIALK EGVRDARGER IKREIEHFLH LPVEAVRTID VYTVDAKLSE EELVKAASEP 
FCDPVIQVWS IDRPLAAGFD FLVEVGYRPG VTDNVGRTGR EAIEYITGRP MEAGEGVYTS
VQYLLSGKLS RADVERIARD LLGNALIQRF VVLDRAEFAA QGGVPVSVPK VQGETKAQVR
DIDLEVSDDE LLRISKDGVL ALTLEEMKII QAHYRDPKVQ EERRKQGLGV RPTDVELECL
AQTWSEHCKH KIFAGTVHYE DEQGNRQEIK SLFKSFIQRT TKDVREKLGD SDFCLSVFKD
NAGVIRFNDD WSLVFKVETH NSPSALDPYG GALTGIVGVN RDPFGTGKGA KLIFNTDVFC
FADPFYDKPL PSRLLHPRRI YEGVVEGVEH GGNKSGIPTV NGSLVFDERF AGKPLVFCGT
AGIMPATLNG ELGHEKAIKP GDLIVMTGGR IGKDGIHGAT FSSEELNENS PVTAVQIGDP
ITQKRMFDFL IRARDKGLYR FITDNGAGGL SSSIGEMSGE CGGCRMDLEK APLKYPGLDP
WEILISEAQE RMSLAVPPEQ IDEFLAMAKR FNVEATVLGE FTDSGYFHIL YGDRTVAWLP
MEFMHEGLPP MQLPAKWVPP RHAEPTLQVK GDYTADLKAL LGSLNICSKE SVVRRYDHEV
QGGSVVKPFT GVANDGPSDA AVVRPILDSF EGVVVAHGIC PRYSDIDTYH MTANAIDEAL
RNYVAVGGAL DLVAGLDNFC WCDPVRSEKT PDGEYKMAQL VRANQALYDV CLAYNLPLIS
GKDSMKNDFY DGATKISIPP TILFSVIGKI EDARRSVTMD AKRPGDVVYL LGRTGNELGG
SEYLALQGVI GNNVPTVNPE KALKRYNALH TAITGGLVAS CHDLSDGGLA VALAEKAFAG
GYGISADLGK VLWTGDDAMR NDAALLFSES ASRHLVTVRP GNRDAFEAIM AGNCFSAIGV
VTEEGAVRIT GLSGQAVVEA SIDELKEAWQ SPLREL