Gene GM21_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0143 
SymbolpurT 
ID8135446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp174704 
End bp175882 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content65% 
IMG OID644867762 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_003019986 
Protein GI253698797 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2
[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones100 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGGGA CGCCGCTGAA AAATAGTGCG ACGAGGATGA TGCTGCTCGG CTCCGGCGAG 
CTCGGCAAGG AAGTGGCCCT GGAGGCGCAG AGGCTGGGGA TCGAGGTGAT AGCGGTGGAC
CGTTATGCCG ACGCTCCGGC GATGCAGGTA GCCCACAGAA GCCATGTGAT CGATATGCTG
GACCGCGAGC AGCTGGACCG GGTGGTGCGC CTGGAGAACC CCTCGCTGAT CGTGCCGGAA
ATCGAGGCGA TCAACACCGT TTACCTTTTG GAGCTTGAGA AGGAAGGTTT CAACGTCATT
CCTACCGCAC GCGCCGCGAA CCTTACCATG AACCGCGAGG GGATACGCAG GCTCGCCGCC
GAGGAACTGG GGCTTCCCAC CGCCGCCTAC CGTTTCGCCA CCAGCACGGA GGAGTTCCGC
GAGGCCGTTG CCGCCATAGG TCTTCCCTGC GTGGTGAAGC CGATCATGAG TTCCTCGGGC
AAGGGGCAGA GTGTGCTGCG CGACGAGGCT GACATGGAGC GCTGCTTCAA GTACGCCATT
GAGGGGGCGC GCGGCGCCTC GAACAAGGTC ATCGTGGAGC AGTTCATCCC GTTCGACTAC
GAGATCACGC TCCTCACCGT CCGGCACGTC GGCGGCACCA GCTTCTGTCC CCCCATCGGG
CACCGCCAGA TCGACGGAGA CTACCACGAG TCGTGGCAGC CGACCCCGAT GGTCCCCGCG
GTGCTGGCCG AAGCTCAGCG CCAGGCCGAG GCCGTCACCG GCTCTCTGGG CGGGCGCGGC
ATCTTCGGCG TAGAGTTCTT CGTCACCGGC GACAAGGTCT GGTTCTCCGA GGTCTCCCCC
CGGCCGCACG ACACCGGGAT GGTCACCATG ATCTCCCAGA ACCTCTCCGA GTTCGAGTTG
CACGTGCGCG CCATCCTGGG TCTGCCGGTT CCGCAGGTGG AGACCCTGGG ATGCGCCGCT
TCGCACGTCA TCCTGTCGGA GGGGGAGGCC GCGGAGGTCT CCTTCGAGGG AGTGGCAAAG
GCCCTGGAGA TCCCTGGCAG CAAGCTGCGC CTTTTCGGCA AGCCCGACAC CAGGAAGGGG
CGCCGCATGG GTGTTGCACT AGCCTTCGGC GCCGACTGCG ACGAGGCCCG CCGGAAGGCC
GAGCAATCCG CCCACTGTGT GGGCATCGTA AAACGCTAG
 
Protein sequence
MIGTPLKNSA TRMMLLGSGE LGKEVALEAQ RLGIEVIAVD RYADAPAMQV AHRSHVIDML 
DREQLDRVVR LENPSLIVPE IEAINTVYLL ELEKEGFNVI PTARAANLTM NREGIRRLAA
EELGLPTAAY RFATSTEEFR EAVAAIGLPC VVKPIMSSSG KGQSVLRDEA DMERCFKYAI
EGARGASNKV IVEQFIPFDY EITLLTVRHV GGTSFCPPIG HRQIDGDYHE SWQPTPMVPA
VLAEAQRQAE AVTGSLGGRG IFGVEFFVTG DKVWFSEVSP RPHDTGMVTM ISQNLSEFEL
HVRAILGLPV PQVETLGCAA SHVILSEGEA AEVSFEGVAK ALEIPGSKLR LFGKPDTRKG
RRMGVALAFG ADCDEARRKA EQSAHCVGIV KR