Gene SNSL254_A4508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4508 
SymbolpurD 
ID6482854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4381298 
End bp4382587 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content61% 
IMG OID642739737 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_002043423 
Protein GI194445679 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.0778373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTT TAGTCATTGG TAACGGCGGG CGCGAACACG CGCTGGCCTG GAAAGCCGCA 
CAGTCGCCGC TGGTTGATAC CGTTTTTGTC GCACCGGGTA ACGCCGGTAC CGCGCTGGAG
CCTGCGTTGC AGAACGTGGC TATCGGCGTC ACCGATATTC CAGCGCTGCT GAGCTTTGCC
CAGAACGAGA AGATTGATCT GACCATCGTC GGCCCGGAAG CGCCGCTGGT GATTGGCGTG
GTCGATGCGT TCCGCGCGGC GGGCCTGAAG ATCTTCGGCC CAACCGAAGG CGCCGCGCAA
CTGGAAGGCT CCAAAGCCTT CACCAAAGAT TTCCTCGCTC GTCACCAGAT TCCGACGGCG
GAATACCAGA ATTTCACCGA GATTGAGCCT GCCCTGGCTT ATCTGCGTGA GAAAGGCGCG
CCGATCGTCA TCAAGGCCGA TGGTCTGGCC GCCGGTAAAG GCGTTATCGT GGCGATGACG
CTGGAAGAAG CAGAAGCTGC CGTACACGAT ATGCTGGCCG GTAACGCCTT TGGCGACGCG
GGTCACCGCA TCGTGATTGA AGAGTTCCTC GACGGCGAGG AAGCGAGCTT TATCGTGATG
GTCGACGGCG AGCACGTTCT GCCGATGGCT ACCAGCCAGG ACCACAAACG CGTGGGCAAT
GGCGATACCG GCCCGAATAC CGGCGGTATG GGGGCCTACT CACCGGCGCC GGTGGTGACT
GATGAAGTGC ACCAGCGCAC CATGGAACGC ATCATCTGGC CAACCGTGAA AGGCATGGCA
GCAGAAGGTA ACACGTACAC CGGCTTCCTG TATGCGGGTC TGATGATCGA CAAGCAGGGC
AACCCGAAAG TTATCGAGTT CAACTGCCGC TTCGGCGATC CGGAAACCCA GCCGATCATG
CTGCGCATGA AATCGGATCT GGTGGATCTT TGCCTGGCCG CCTGCGACGG CAAGCTGGAT
GAGAAAACCT CCGAGTGGGA CGAACGCGCT TCATTAGGCG TGGTGATCGC CGCGGGCGGT
TATCCGGGCA ACTACAACAC TGGCGATGAG ATCCACGGCC TGCCGCTGGA AGAAGTGGCT
GACGGTAAGG TTTTCCACGC GGGCACCAAA CTCGCCGATG ACGACCGTGT GCTGACCAGC
GGCGGACGCG TCCTGTGCGC CACCGCGCTG GGCCACACCG TCGCCGAAGC GCAGAAACGC
GCTTACGCCC TGATGACCGA CATCCGCTGG GACGGCAGCT TCAGCCGTAA CGACATCGGC
TGGCGCGCCA TCGAACGCGA ACAGCGCTAA
 
Protein sequence
MKVLVIGNGG REHALAWKAA QSPLVDTVFV APGNAGTALE PALQNVAIGV TDIPALLSFA 
QNEKIDLTIV GPEAPLVIGV VDAFRAAGLK IFGPTEGAAQ LEGSKAFTKD FLARHQIPTA
EYQNFTEIEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA
GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGN GDTGPNTGGM GAYSPAPVVT
DEVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM
LRMKSDLVDL CLAACDGKLD EKTSEWDERA SLGVVIAAGG YPGNYNTGDE IHGLPLEEVA
DGKVFHAGTK LADDDRVLTS GGRVLCATAL GHTVAEAQKR AYALMTDIRW DGSFSRNDIG
WRAIEREQR