Gene SeD_A4582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4582 
SymbolpurD 
ID6873473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4421597 
End bp4422886 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content61% 
IMG OID642787489 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_002218091 
Protein GI198241778 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.213422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTT TAGTCATTGG TAACGGCGGG CGCGAACACG CGCTGGCCTG GAAAGCCGCA 
CAGTCGCCGC TGGTTGATAC CGTTTTTGTC GCACCGGGTA ACGCCGGTAC CGCGCTGGAG
CCAGCGTTGC AGAACGTGGC TATCGGCGTC ACCGATATTC CGGCGCTGCT GAGCTTTGCC
CAGAACGAGA AAATAGATCT GACCATCGTC GGCCCGGAAG CGCCGCTGGT GATTGGCGTG
GTCAATGCGT TCCGCGCGGC GGGTCTGAAG ATCTTTGGCC CAACCGAAGG GGCCGCCCAA
CTGGAAGGCT CCAAAGCCTT CACCAAAGAT TTCCTCGCTC GTCACCAGAT TCCGACGGCG
GAATACCAGA ATTTCACCGA GATTGAGCCA GCCCTGGCTT ATCTGCGTGA GAAAGGCGCG
CCGATCGTCA TCAAAGCTGA CGGTCTGGCT GCCGGTAAAG GCGTTATCGT GGCGATGACG
CTGGAAGAAG CCGAAGCCGC CGTTCATGAC ATGCTGGCAG GCAACGCCTT TGGCGACGCG
GGCCACCGTA TCGTGATTGA GGAGTTCCTC GACGGCGAAG AAGCGAGCTT TATCGTGATG
GTCGACGGCG AGCACGTGCT GCCGATGGCC ACCAGCCAGG ATCACAAACG CGTAGGCAAC
GGCGATACCG GCCCGAACAC CGGCGGCATG GGGGCTTACT CTCCGGCTCC AGTGGTGACC
GATGAAGTCC ACCAGCGCAC CATGGAACGG ATCATCTGGC CAACCGTGAA AGGCATGGCG
GCAGAAGGGA ATACCTATAC CGGCTTCCTG TACGCGGGTC TGATGATCGA CAAGCAGGGC
AACCCAAAAG TGATCGAGTT CAACTGCCGC TTCGGCGATC CGGAAACCCA GCCGATCATG
CTGCGCATGA AGTCGGATCT GGTGGATCTT TGCCTGGCCG CCTGCGACGG CAAGCTGGAT
GAGAAAACCT CCGAGTGGGA CGAACGCGCT TCATTAGGCG TGGTGATCGC CGCGGGCGGT
TATCCGGGCA ACTACAACAC TGGCGATGAG ATCCACGGCC TGCCGCTGGA AGAAGTGGCT
GACGGTAAGG TTTTCCACGC GGGCACCAAA CTCGCCGATG ACGACCGTGT GCTGACCAGC
GGCGGTCGCG TACTGTGCGC CACCGCGCTG GGCCACACCG TGGCTGAGGC GCAGAAACGC
GCTTACGCCT TGATGACCGA CATCCGCTGG GACGGCAGCT TCAGCCGTAA CGACATCGGC
TGGCGCGCCA TTGAGCGTGA GCAAAACTAA
 
Protein sequence
MKVLVIGNGG REHALAWKAA QSPLVDTVFV APGNAGTALE PALQNVAIGV TDIPALLSFA 
QNEKIDLTIV GPEAPLVIGV VNAFRAAGLK IFGPTEGAAQ LEGSKAFTKD FLARHQIPTA
EYQNFTEIEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA
GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGN GDTGPNTGGM GAYSPAPVVT
DEVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM
LRMKSDLVDL CLAACDGKLD EKTSEWDERA SLGVVIAAGG YPGNYNTGDE IHGLPLEEVA
DGKVFHAGTK LADDDRVLTS GGRVLCATAL GHTVAEAQKR AYALMTDIRW DGSFSRNDIG
WRAIEREQN