Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2091 |
Symbol | purC |
ID | 2686733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 2297786 |
End bp | 2298676 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637126782 |
Product | phosphoribosylaminoimidazole-succinocarboxamide synthase |
Protein accession | NP_953140 |
Protein GI | 39997189 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0152] Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase |
TIGRFAM ID | [TIGR00081] phosphoribosylaminoimidazole-succinocarboxamide synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.345869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAC TGGTGCTCAA GACCGATTTC CCCGACCTGA AACTTGCCGG CCGCGGCAAG GTACGCGACA TTTACGATCT GGGCGACGCG CTGCTCATCG TCACCACCGA CCGGATCTCG GCCTTCGACG TCATCATGAA CGAGGCGATT CCCGACAAGG GGTACGTCCT CACCCAGATA TCCTCTTTCT GGTTCCGCCA GATGGAGGAC ATCATCCCCA ACCACATCAT CTCCACCGAC GTGAAGGACT TCCCGGCCGA GTGCCAGAAG TATGCGGCCC AGCTCGAAGG CCGTTCCATG CTGGTCAAGA AGGCCAAGCC CCTGCCGGTG GAATGCATCG TGCGCGGCTA CATCTCCGGC TCCGGCTGGA AGGACTACAA GGCCACCGGC GCCATCTGCG GCATCACGCT CCCGGCCGGC CTCGTGGAAT CCGACAAGCT GGAGGAGCCC ATCTTCACCC CCTCCACCAA GGCGGAACTG GGTGAGCATG ACGAGAACAT CTCCTTCGAC AAGTGCGTGG AGCTGATCGG GCGTGAACTG GCCGAAAAGA TCCGCGACGT CACCATCGCC ATCTACAAGC GGGCCCGGGA CATCGCCGAC ACCAAAGGAA TCATCATCGC CGACACCAAG TTCGAGTACG GCATCTACAA CGGCGAACTG ATCATCATCG ACGAGTGCAT GACCCCGGAC TCCTCGCGCT TCTGGCCCAA GGACAGCTAC AAGCCGGGCG GCGCCCAGCC CTCCTTTGAC AAGCAGTTCC TGCGGGACTA CCTGGAGACC CTCGACTGGG GCAAAACCGC GCCGGCGCCG CCCCTGCCCG AGGAAATCGT CCGCAAGACC GGCGAGAAGT ACATGGAAGC CCTGGTCCGT CTCACCGGAA AAGGCAAGTA A
|
Protein sequence | MAELVLKTDF PDLKLAGRGK VRDIYDLGDA LLIVTTDRIS AFDVIMNEAI PDKGYVLTQI SSFWFRQMED IIPNHIISTD VKDFPAECQK YAAQLEGRSM LVKKAKPLPV ECIVRGYISG SGWKDYKATG AICGITLPAG LVESDKLEEP IFTPSTKAEL GEHDENISFD KCVELIGREL AEKIRDVTIA IYKRARDIAD TKGIIIADTK FEYGIYNGEL IIIDECMTPD SSRFWPKDSY KPGGAQPSFD KQFLRDYLET LDWGKTAPAP PLPEEIVRKT GEKYMEALVR LTGKGK
|
| |