Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0143 |
Symbol | purT |
ID | 8135446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 174704 |
End bp | 175882 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644867762 |
Product | phosphoribosylglycinamide formyltransferase 2 |
Protein accession | YP_003019986 |
Protein GI | 253698797 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) |
TIGRFAM ID | [TIGR01142] phosphoribosylglycinamide formyltransferase 2 [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 100 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGGGA CGCCGCTGAA AAATAGTGCG ACGAGGATGA TGCTGCTCGG CTCCGGCGAG CTCGGCAAGG AAGTGGCCCT GGAGGCGCAG AGGCTGGGGA TCGAGGTGAT AGCGGTGGAC CGTTATGCCG ACGCTCCGGC GATGCAGGTA GCCCACAGAA GCCATGTGAT CGATATGCTG GACCGCGAGC AGCTGGACCG GGTGGTGCGC CTGGAGAACC CCTCGCTGAT CGTGCCGGAA ATCGAGGCGA TCAACACCGT TTACCTTTTG GAGCTTGAGA AGGAAGGTTT CAACGTCATT CCTACCGCAC GCGCCGCGAA CCTTACCATG AACCGCGAGG GGATACGCAG GCTCGCCGCC GAGGAACTGG GGCTTCCCAC CGCCGCCTAC CGTTTCGCCA CCAGCACGGA GGAGTTCCGC GAGGCCGTTG CCGCCATAGG TCTTCCCTGC GTGGTGAAGC CGATCATGAG TTCCTCGGGC AAGGGGCAGA GTGTGCTGCG CGACGAGGCT GACATGGAGC GCTGCTTCAA GTACGCCATT GAGGGGGCGC GCGGCGCCTC GAACAAGGTC ATCGTGGAGC AGTTCATCCC GTTCGACTAC GAGATCACGC TCCTCACCGT CCGGCACGTC GGCGGCACCA GCTTCTGTCC CCCCATCGGG CACCGCCAGA TCGACGGAGA CTACCACGAG TCGTGGCAGC CGACCCCGAT GGTCCCCGCG GTGCTGGCCG AAGCTCAGCG CCAGGCCGAG GCCGTCACCG GCTCTCTGGG CGGGCGCGGC ATCTTCGGCG TAGAGTTCTT CGTCACCGGC GACAAGGTCT GGTTCTCCGA GGTCTCCCCC CGGCCGCACG ACACCGGGAT GGTCACCATG ATCTCCCAGA ACCTCTCCGA GTTCGAGTTG CACGTGCGCG CCATCCTGGG TCTGCCGGTT CCGCAGGTGG AGACCCTGGG ATGCGCCGCT TCGCACGTCA TCCTGTCGGA GGGGGAGGCC GCGGAGGTCT CCTTCGAGGG AGTGGCAAAG GCCCTGGAGA TCCCTGGCAG CAAGCTGCGC CTTTTCGGCA AGCCCGACAC CAGGAAGGGG CGCCGCATGG GTGTTGCACT AGCCTTCGGC GCCGACTGCG ACGAGGCCCG CCGGAAGGCC GAGCAATCCG CCCACTGTGT GGGCATCGTA AAACGCTAG
|
Protein sequence | MIGTPLKNSA TRMMLLGSGE LGKEVALEAQ RLGIEVIAVD RYADAPAMQV AHRSHVIDML DREQLDRVVR LENPSLIVPE IEAINTVYLL ELEKEGFNVI PTARAANLTM NREGIRRLAA EELGLPTAAY RFATSTEEFR EAVAAIGLPC VVKPIMSSSG KGQSVLRDEA DMERCFKYAI EGARGASNKV IVEQFIPFDY EITLLTVRHV GGTSFCPPIG HRQIDGDYHE SWQPTPMVPA VLAEAQRQAE AVTGSLGGRG IFGVEFFVTG DKVWFSEVSP RPHDTGMVTM ISQNLSEFEL HVRAILGLPV PQVETLGCAA SHVILSEGEA AEVSFEGVAK ALEIPGSKLR LFGKPDTRKG RRMGVALAFG ADCDEARRKA EQSAHCVGIV KR
|
| |