Gene Shewmr4_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3020 
SymbolpurT 
ID4253591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3606924 
End bp3608099 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID638119662 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_735148 
Protein GI113971355 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.190979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGGAA CTCCCTACAC AGAGGGCGCT CGACGCGCCA TGTTGCTTGG CTGCGGTGAG 
CTAGGTAAAG AAGTCGCCAT CGAGCTCCAA CGCTTAGGTG TTGAAGTGAT TGGCGTCGAT
CGTTATCCCA ATGCCCCCGC CATGCAAATT GCCCATCGCT CCCATGTGAT CAATATGCTC
GATGCAAAAG CGCTTCGCGC CATTATCGAG CTAGAAAAGC CCCACTTAGT GATCCCCGAA
ATTGAAGCTA TTGCCACTCA AACCTTAGTT GAGATGGAAG CCGAAGGCCT CAATGTCGTG
CCGACAGCGC GCGCAACTCA GCTGACCATG GACAGAGAAG GCATTCGTCG CCTCGCCGCC
GAAACCTTAG GTCTGCCGAC CTCGCCCTAT TTCTTCTGCG ACACTGAAAC CGAATTTAAT
CAAGCCATTG GCAAGATTGG CGTGCCCTGC GTAGTCAAAC CCGTGATGAG CTCATCGGGC
AAGGGCCAAA GTGTTATCCG TGATGTATCC CAAAGCGCCA AAGCCTGGCA ATATGCCCAA
GAAGGCGGCC GCGCGGGCGG TGGTCGTGTG ATTGTCGAAG GCTTTATCCC CTTCGATTAC
GAAATTACCC TGCTGACCAT TAGCGCAGTC AATGGCATCC ACTTCTGCGC GCCAATTGGC
CACAGGCAAG AAGACGGCGA CTACCGCGAG TCATGGCAAC CTCAAGCCAT GTCGGCCGAC
GTGCTAGCAA AATCCCAAGC AATCGCCAGC AAAGTGGTGG AAGCCCTCGG CGGTTACGGC
TTATTTGGGG TCGAGCTGTT TGTGAAGGGC AGCGATGTGT ACTTCTCTGA AGTCTCGCCT
CGTCCGCACG ATACCGGTTT AGTCACCTTA ATTAGCCAAG ATTTATCCGA GTTTGCACTG
CATGTCAGGG CAATTCTTGG CCTGCCGATT CCGAATATCC ATCAACATGG CCCCAGCGCC
TCGGCGGTAG TATTGGTGGA AGGCAAATCG AAAAACATTC GCTATCAAGG TCTTGCCGAT
GCCTTGGCGG CGGAAAATAC TCAGCTCAGA TTATTCGCTA AGCCTGAAAT CGATGGTCGC
CGCCGTTTAG GGGTTGCCCT CGCCCGCGAT AAAGATATCG AAAGCGCAGT CAATAAAGCG
CTGGATAGTG CATCTAAGGT AAAAGTGATT TTCTAG
 
Protein sequence
MIGTPYTEGA RRAMLLGCGE LGKEVAIELQ RLGVEVIGVD RYPNAPAMQI AHRSHVINML 
DAKALRAIIE LEKPHLVIPE IEAIATQTLV EMEAEGLNVV PTARATQLTM DREGIRRLAA
ETLGLPTSPY FFCDTETEFN QAIGKIGVPC VVKPVMSSSG KGQSVIRDVS QSAKAWQYAQ
EGGRAGGGRV IVEGFIPFDY EITLLTISAV NGIHFCAPIG HRQEDGDYRE SWQPQAMSAD
VLAKSQAIAS KVVEALGGYG LFGVELFVKG SDVYFSEVSP RPHDTGLVTL ISQDLSEFAL
HVRAILGLPI PNIHQHGPSA SAVVLVEGKS KNIRYQGLAD ALAAENTQLR LFAKPEIDGR
RRLGVALARD KDIESAVNKA LDSASKVKVI F