Gene SeHA_C2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2097 
SymbolpurT 
ID6489481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2024685 
End bp2025863 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID642742293 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_002045936 
Protein GI194448671 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.310007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTAT TAGGCACTGC GCTGCGTCCG GCAGCAACGC GGGTGATGTT ATTAGGGGCA 
GGTGAATTGG GAAAAGAGGT GGCGATTGAA TGCCAACGCC TGGGGATCGA GGTTATCGCC
GTCGATCGCT ATCCTGATGC TCCCGCCATG CATGTGGCTC ACCGTTCACA CGTCATTAAT
ATGCTGGACG GCGAGGCGCT ACGTCATGTG ATTACAGAGG AAAAACCGCA TTATATCGTG
CCGGAAATAG AAGCGATCGC CACCGATACG CTGCGCGAGC TGGAGGGCGA AGGGCTGAAT
GTCGTGCCTT GCGCCCGTGC AACGCAGCTC ACGATGAACC GCGAAGGGAT CCGTCGCCTG
GCCGCAGAAG AATTAGGACT GCCGACATCG ACGTATCGCT TTGCCGACAG TGAGGCCAGT
TTTCATGATG CGGTAGCCGC AGTGGGTTTT CCTTGCATCG TCAAACCGGT CATGAGCTCT
TCCGGCAAAG GCCAGAGCTT TATCCGCTCG GCCGAACAAC TCGCGCAGGC ATGGGAGTAT
GCTCAACAGG GCGGACGCGC TGGCGCGGGT CGCGTGATTG TGGAAGGCGT GGTTAAATTT
GATTTTGAAA TTACGCTGCT CACCGTTAGC GCCGTCGATG GCGTGCATTT CTGCGCGCCG
GTCGGTCATC GTCAGCAAGA TGGTGACTAT CGCGAATCCT GGCAGCCACA GCAGATGAGC
GAACTGGCGC TGAAGCGGGC GCAAGAGATT GCTCGTCATG TGGTACTGGC GTTAGGCGGT
CATGGCCTGT TCGGCGTTGA ACTCTTCGTC TGTGGCGATG AAGTCATTTT CAGCGAAGTC
TCCCCTCGCC CGCACGATAC CGGAATGGTC ACGTTGATTT CTCAGGATCT CTCTGAGTTT
GCGCTGCATG TGCGCGCCTT TCTGGGAATG CCCGTAGGCG CTATTCGCCA GTATGGTCCC
GCTGCCTCGG CCGTGATTCT GCCGCAGCTT ACCAGTCAAA ATGTGACGTT TGATAATGTA
CACGCGGCGG TAGGAGCCGG AGTGCAGGTA CGGCTGTTTG GTAAGCCTGA GATCGACGGC
ACCCGTCGTC TTGGTGTAGC GTTAGCGACA GGTGAAAACG TTGAAGAAGC GGTGATAAGA
GCGAAAAAGG CCGTCAGCCG CGTGACGGTA AAAGGCTAA
 
Protein sequence
MTLLGTALRP AATRVMLLGA GELGKEVAIE CQRLGIEVIA VDRYPDAPAM HVAHRSHVIN 
MLDGEALRHV ITEEKPHYIV PEIEAIATDT LRELEGEGLN VVPCARATQL TMNREGIRRL
AAEELGLPTS TYRFADSEAS FHDAVAAVGF PCIVKPVMSS SGKGQSFIRS AEQLAQAWEY
AQQGGRAGAG RVIVEGVVKF DFEITLLTVS AVDGVHFCAP VGHRQQDGDY RESWQPQQMS
ELALKRAQEI ARHVVLALGG HGLFGVELFV CGDEVIFSEV SPRPHDTGMV TLISQDLSEF
ALHVRAFLGM PVGAIRQYGP AASAVILPQL TSQNVTFDNV HAAVGAGVQV RLFGKPEIDG
TRRLGVALAT GENVEEAVIR AKKAVSRVTV KG