Gene SeD_A1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1366 
SymbolpurT 
ID6873241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1342525 
End bp1343703 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID642784534 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_002215204 
Protein GI198243510 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000000213864 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCTAT TAGGCACTGC GCTGCGTCCG GCAGCAACGC GGGTGATGTT ATTAGGGGCA 
GGTGAATTGG GAAAAGAGGT GGCGATTGAA TGCCAACGCC TGGGGATCGA GGTTATCGCC
GTCGATCGCT ATCCTGATGC GCCCGCCATG CATGTGGCTC ACCGTTCACA CGTCATTAAT
ATGCTGGACG GCGAGGCGCT ACGTCATGTG ATTACAGAGG AAAAACCGCA TTATATCGTG
CCGGAAATAG AAGCGATCGC CACCGATACG CTGCGCGAGC TGGAGGGCGA AGGGCTGAAT
GTCGTGCCTT GCGCCCGTGC AACGCAGCTC ACGATGAACC GCGAAGGGAT CCGTCGCCTG
GCCGCAGAAG AATTAGGTCT GCCGACATCG ACGTATCGCT TTGCCGACAG TGAGGCCAGT
TTTCATGATG CGGTAGCCGC AGTGGGTTTT CCTTGCATCG TCAAACCGGT CATGAGCTCT
TCCGGCAAAG GCCAGAGCTT TATCCGCTCG GCCGAACAGC TCGCGCAGGC ATGGGAGTAT
GCTCAACAGG GCGGACGCGC TGGCGCGGGT CGCGTGATTG TGGAAGGCGT GGTTAAATTT
GATTTTGAAA TTACGCTGCT CACCGTTAGC GCCGTCGATG GCGTGCATTT CTGCGCGCCG
GTCGGTCATC GTCAGCAAGA TGGTGACTAT CGCGAATCCT GGCAGCCACA GCAGATGAGC
GAACTGGCGC TGAAGCGGGC GCAAGAGATT GCGCGTCATG TGGTACTGGC GTTAGGCGGT
CATGGTCTGT TCGGCGTTGA ACTCTTCGTC TGTGGCGATG AAGTCATTTT CAGCGAAGTC
TCCCCTCGCC CGCACGATAC CGGAATGGTC ACGTTGATTT CTCAGGATCT CTCTGAGTTT
GCGCTGCATG TGCGCGCCTT TCTGGGAATG CCCGTAGGCG CTATTCGCCA GTATGGTCCC
GCTGCCTCGG CCGTGATTCT GCCGCAGCTT ACCAGTCAAA ATGTGACGTT TGATAATGTA
CACGCGGCGG TAGGAGCCGG AGTACAGGTA CGGCTGTTTG GTAAGCCTGA GATCGACGGC
ACTCGTCGTC TTGGTGTAGC GTTAGCGACA GGTGAAAACG TTGAAGAAGC GGTGATAAGA
GCGAAAAAGG CCGCCAGCCG CGTGACGGTA AAAGGCTAA
 
Protein sequence
MTLLGTALRP AATRVMLLGA GELGKEVAIE CQRLGIEVIA VDRYPDAPAM HVAHRSHVIN 
MLDGEALRHV ITEEKPHYIV PEIEAIATDT LRELEGEGLN VVPCARATQL TMNREGIRRL
AAEELGLPTS TYRFADSEAS FHDAVAAVGF PCIVKPVMSS SGKGQSFIRS AEQLAQAWEY
AQQGGRAGAG RVIVEGVVKF DFEITLLTVS AVDGVHFCAP VGHRQQDGDY RESWQPQQMS
ELALKRAQEI ARHVVLALGG HGLFGVELFV CGDEVIFSEV SPRPHDTGMV TLISQDLSEF
ALHVRAFLGM PVGAIRQYGP AASAVILPQL TSQNVTFDNV HAAVGAGVQV RLFGKPEIDG
TRRLGVALAT GENVEEAVIR AKKAASRVTV KG