Gene SNSL254_A2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2042 
SymbolpurT 
ID6484155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1981288 
End bp1982466 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID642737398 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_002041148 
Protein GI194444508 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.489284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTAT TAGGCACTGC GCTGCGTCCG GCAGCAACGC GAGTGATGTT ATTAGGGGCA 
GGAGAATTAG GAAAAGAGGT GGCGATTGAA TGCCAACGCC TGGGGATCGA GGTTATCGCC
GTCGATCGCT ATCCTGATGC TCCCGCCATG CATGTGGCTC ACCGTTCACA CGTCATTAAT
ATGCTGGACG GCGAGGCGCT ACGTCATGTG ATTACAGAGG AAAAACCGCA TTATATCGTG
CCGGAAATAG AAGCGATCGC CACCGATACG CTGCGCGAGC TGGAGGGCGA AGGGCTGAAT
GTCGTGCCTT GCGCCCGTGC AACGCAGCTC ACGATGAACC GCGAAGGGAT CCGTCGCCTG
GCCGCAGAAG AATTAGGACT GCCGACATCG ACGTATCGCT TTGCCGACAG TGAGGCCAGT
TTTCATGATG CGGTAGCCGC AGTGGGTTTT CCTTGCATCG TCAAACCGGT CATGAGCTCT
TCCGGCAAAG GCCAGAGCTT TATCCGCTCG GCCGAACAGC TCGCGCAGGC ATGGGAGTAT
GCTCAACAGG GCGGACGCGC TGGCGCGGGT CGCGTGATTG TGGAAGGCGT GGTTAAATTT
GATTTTGAAA TTACGCTGCT CACCGTTAGC GCCGTCGATG GCGTGCATTT CTGCGCGCCG
GTCGGTCATC GTCAGCAAGA TGGTGACTAT CGCGAATCCT GGCAGCCACA GCAGATGAGC
GAACTGGCGC TGAAGCGGGC GCAAGAGATT GCTCGTCATG TGGTACTGGC GTTAGGCGGT
CATGGCCTGT TCGGCGTTGA ACTCTTCGTC TGTGGCGATG AAGTCATTTT CAGCGAAGTC
TCCCCTCGCC CGCACGATAC CGGAATGGTC ACGTTGATTT CTCAGGATCT CTCTGAGTTT
GCGCTGCATG TGCGCGCCTT TCTGGGAATG CCCGTAGGCG CTATTCGCCA GTATGGTCCC
GCTGCCTCGG CCGTGATTCT GCCGCAGCTT ACCAGTCAAA ATGTGACGTT TGATAATGTA
CACGCGGCGG TAGGAGCCGG AGTGCAGGTA CGGCTGTTTG GTAAGCCTGA GATCGACGGC
AGCCGTCGTC TTGGTGTAGC GTTAGCGACA GGTGAAAACG TTGAAGAAGC GGTGATAAGA
GCGAAAAAGG CCGCCAGCCG CGTGACGGTA AAAGGCTAA
 
Protein sequence
MTLLGTALRP AATRVMLLGA GELGKEVAIE CQRLGIEVIA VDRYPDAPAM HVAHRSHVIN 
MLDGEALRHV ITEEKPHYIV PEIEAIATDT LRELEGEGLN VVPCARATQL TMNREGIRRL
AAEELGLPTS TYRFADSEAS FHDAVAAVGF PCIVKPVMSS SGKGQSFIRS AEQLAQAWEY
AQQGGRAGAG RVIVEGVVKF DFEITLLTVS AVDGVHFCAP VGHRQQDGDY RESWQPQQMS
ELALKRAQEI ARHVVLALGG HGLFGVELFV CGDEVIFSEV SPRPHDTGMV TLISQDLSEF
ALHVRAFLGM PVGAIRQYGP AASAVILPQL TSQNVTFDNV HAAVGAGVQV RLFGKPEIDG
SRRLGVALAT GENVEEAVIR AKKAASRVTV KG