Gene SeHA_C4506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4506 
SymbolpurD 
ID6491456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4386561 
End bp4387850 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content60% 
IMG OID642744580 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_002048160 
Protein GI194450383 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.0278032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTT TAGTCATTGG TAACGGCGGG CGCGAACACG CGCTGGCCTG GAAAGCCGCA 
CAGTCGCCGT TGGTTGATAC CGTTTTTGTC GCACCGGGTA ACGCCGGTAC CGCGCTGGAG
CCAGCGTTGC AGAACGTGGC TATCGGCGTC ACCGATATTC CGGCGCTGCT GAGCTTTGCC
CAGAACGAGA AGATAGATCT GACCATCGTT GGCCCGGAAG CGCCGCTGGT GATTGGTGTG
GTGGATGAGT TCCGCGCGGC GGGTCTGAAG ATCTTTGGCC CAACCGAAGG GGCCGCCCAA
CTGGAAGGCT CCAAAGCGTT CACCAAAGAT TTCCTCGCTC GTCACCAGAT TCCGACGGCG
GAATACCAGA ATTTCACCGA GATTGAGCCA GCCCTGGCTT ATCTGCGTGA GAAAGGCGCG
CCGATCGTCA TCAAAGCTGA CGGTCTGGCT GCCGGTAAAG GCGTTATCGT GGCGATGACG
CTGGAAGAAG CCGAAGCTGC CGTTCATGAC ATGCTGGCGG GTAACGCTTT TGGTGATGCG
GGACATCGTA TCGTCATCGA AGAGTTCCTC GACGGCGAAG AGGCAAGCTT TATCGTGATG
GTCGACGGCG AGCACGTGCT GCCGATGGCC ACCAGCCAGG ACCACAAACG CGTAGGCAAC
GGCGATACCG GCCCGAACAC CGGCGGCATG GGGGCTTACT CTCCGGCTCC AGTGGTAACC
GATGAAGTGC ATCAGCGCAC CATGGAACGC ATCATTTGGC CAACCGTGAA AGGCATGGCG
GCGGAAGGTA ACACGTACAC CGGCTTCCTG TACGCGGGTC TGATGATCGA CAAGCAGGGT
AATCCGAAGG TTATCGAGTT CAACTGCCGC TTCGGCGATC CGGAAACCCA GCCGATCATG
TTGCGCATGA AGTCGGACCT GGTGGATCTT TGCCTGGCCG CCTGCGAAGG CAAGCTGGAT
GAGAAAACCT CCGAGTGGGA CGAGCGCGCT TCATTAGGCG TGGTGATCGC CGCGGGCGGT
TATCCGGGCA ACTACAACAC TGGCGATGAG ATCCACGGCC TGCCGCTGGA AGAAGTGGCT
GACGGTAAGG TTTTCCACGC GGGCACCAAA CTCGCCGATG ACGACCGTGT GCTGACCAGC
GGCGGACGCG TCCTGTGCGC CACCGCGCTG GGCCACACCG TCGCCGAAGC GCAGAAACGC
GCTTACGCCC TGATGACCGA CATCCGCTGG GACGGCAGCT TCAGCCGTAA CGACATCGGC
TGGCGCGCCA TCGAACGCGA ACAGCGCTAA
 
Protein sequence
MKVLVIGNGG REHALAWKAA QSPLVDTVFV APGNAGTALE PALQNVAIGV TDIPALLSFA 
QNEKIDLTIV GPEAPLVIGV VDEFRAAGLK IFGPTEGAAQ LEGSKAFTKD FLARHQIPTA
EYQNFTEIEP ALAYLREKGA PIVIKADGLA AGKGVIVAMT LEEAEAAVHD MLAGNAFGDA
GHRIVIEEFL DGEEASFIVM VDGEHVLPMA TSQDHKRVGN GDTGPNTGGM GAYSPAPVVT
DEVHQRTMER IIWPTVKGMA AEGNTYTGFL YAGLMIDKQG NPKVIEFNCR FGDPETQPIM
LRMKSDLVDL CLAACEGKLD EKTSEWDERA SLGVVIAAGG YPGNYNTGDE IHGLPLEEVA
DGKVFHAGTK LADDDRVLTS GGRVLCATAL GHTVAEAQKR AYALMTDIRW DGSFSRNDIG
WRAIEREQR