Gene YpsIP31758_3844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3844 
SymbolpurD 
ID5385221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4331184 
End bp4332470 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content52% 
IMG OID640866869 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001402795 
Protein GI153950785 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.000315171 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT TGATAATTGG TAACGGCGGT CGTGAACACG CTCTGGGCTG GAAAGCCGCC 
CAATCTCCTT TAGCGGACAA AATTTATGTT GCACCAGGTA ATGCGGGTAC AGCACTGGAA
CCGACCTTAG AAAATGTTGA TATCGCCGCC ACTGATATTG CCGGTTTACT GGCCTTTGCT
CAAAGTCATG ATATCGGCCT GACGATTGTT GGCCCAGAAG CCCCTTTGGT GATCGGCGTG
GTTGATGCGT TCCGCGCTGC TGGTTTAGCT ATTTTTGGCC CGACTCAGGC TGCGGCTCAA
TTAGAGGGTT CTAAAGCCTT CACCAAAGAT TTCCTGGCCC GTCACAACAT TCCCTCTGCG
GAATACCAAA ACTTTACAGA TGTCGAGGCC GCATTGGCCT ATGTGCGTCA AAAAGGTGCG
CCAATCGTTA TCAAAGCCGA TGGTCTGGCC GCCGGTAAAG GCGTGATTGT TGCGATGACG
CAGGAAGAAG CCGAAACCGC CGTGAATGAT ATGTTGGCCG GTAACGCTTT TGGTGATGCA
GGGCACCGTA TCGTGGTGGA AGAGTTCCTT GATGGCGAAG AAGCCAGCTT TATCGTGATG
GTTGATGGCG AAAATGTTTT GCCAATGGCG ACCAGTCAGG ATCATAAGCG AGTTGGCGAT
GGTGATACCG GGCCAAATAC CGGCGGAATG GGTGCTTATT CCCCAGCCCC CGTGGTAACA
GATGATGTTC ACCAACGGGT CATGGATCAG GTTATTTGGC CGACCGTGCG TGGTATGGCG
GCGGAAGGTA ATATTTACAC CGGTTTCCTC TATGCTGGCC TGATGATTTC AGCCGATGGG
CAACCCAAAG TCATTGAGTT CAACTGCCGC TTTGGCGATC CAGAAACGCA GCCAATCATG
TTGCGTATGC GCTCCGATTT GGTCGAACTG TGTTTAGCCG GTACACAAGG CAAACTAAAT
GAAAAAACCT CAGACTGGGA TGAGCGCCCA TCACTGGGGG TCGTTTTAGC CGCTGGCGGT
TATCCAGCAG ATTACCGCCA GGGTGATGTT ATTCATGGCT TACCACAGCA AGAAGTCAAG
GATGGAAAAG TCTTCCACGC GGGGACCAAG CTGAATGGGA ATCATGAAGT TGTCACCAAT
GGTGGCCGCG TCTTGTGTGT CACTGCACTC GGTGAAACCG TCGCGCAGGC GCAACAATAT
GCCTATCAGT TAGCTGAGGG GATCCAGTGG GAAGGGGTTT TCTGCCGTAA AGATATTGGT
TATCGAGCGA TTGCTCGCGG TAAGTAA
 
Protein sequence
MNILIIGNGG REHALGWKAA QSPLADKIYV APGNAGTALE PTLENVDIAA TDIAGLLAFA 
QSHDIGLTIV GPEAPLVIGV VDAFRAAGLA IFGPTQAAAQ LEGSKAFTKD FLARHNIPSA
EYQNFTDVEA ALAYVRQKGA PIVIKADGLA AGKGVIVAMT QEEAETAVND MLAGNAFGDA
GHRIVVEEFL DGEEASFIVM VDGENVLPMA TSQDHKRVGD GDTGPNTGGM GAYSPAPVVT
DDVHQRVMDQ VIWPTVRGMA AEGNIYTGFL YAGLMISADG QPKVIEFNCR FGDPETQPIM
LRMRSDLVEL CLAGTQGKLN EKTSDWDERP SLGVVLAAGG YPADYRQGDV IHGLPQQEVK
DGKVFHAGTK LNGNHEVVTN GGRVLCVTAL GETVAQAQQY AYQLAEGIQW EGVFCRKDIG
YRAIARGK