Gene Dshi_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1889 
SymbolpurM 
ID5712881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1966227 
End bp1967288 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content71% 
IMG OID641267813 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001533232 
Protein GI159044438 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0381482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.145753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG ACACACCGCC GCCGAAACCC GGTCTGACAT ATGCCGAGGC CGGCGTCGAC 
ATCGATGCGG GCAACACCCT GGTGGACCGG ATCAAGCCCG CCGCCAAGGC CACCTCTCGC
CCGGGCGTGA TGAGCGGTCT GGGCGGTTTC GGCGCACTGT TCGACCTTAG GGCCGCGGGC
TACGCCGACC CGGTGCTGGT GGCCGCCACG GACGGGGTCG GCACCAAGCT GCGGATCGCC
ATCGACACCG GCCATGTCGA CACGATCGGG ATCGACCTGG TAGCGATGTG CGTCAACGAC
CTCGTGTGCC AGGGGGCTGA ACCGCTGCTT TTCCTGGACT ATTTCGCCAC CGGAAAGCTC
GACGTGGCCG AGGCCGCGAC GATCGTCGAG GGTATCGCCC GGGGCTGCGC CACTTCCGGC
TGCGCGCTGA TCGGCGGCGA AACCGCCGAG ATGCCGGGCA TGTATGCCAA GGGCGATTTC
GACCTCGCGG GCTTTGCCGT CGGCGCGATG GAGCGGGGCG GCGCGTTGCC CGCGAATGTG
GCGGCAGGGG ACATGATCCT CGGGCTGGCC TCGGACGGGG TCCATTCCAA CGGCTACTCG
CTGGTGCGTC GGATCGTCGA GCGCTCCGGT CTGGGCTGGG GCGATCCCGC ACCGTTCGAG
GGCCGGACTC TCGGCGCGGC CCTGCTGACG CCCACGCGGC TCTACGTGCA ACCGGCGCTG
GCGGCGATCC GCGCGGGCGG GGTGCACGGG CTGGCCCATG TCACCGGCGG CGGGCTGACC
GAGAACCTGC CCCGGGTGCT GCCCGAGGGG CTGGGGATCG AGATCAACCT CGGCGCGTGG
GAATTGCCGC CGGTGTTCCG CTGGCTCGCC GCCGAGGGCG GGCTCGACGA GGCCGAACTG
CTCAAGACCT TCAACGCCGG GATCGGCATG GCCCTGATCG TGGCGCCCGA CCGGGCCGAG
GCGCTCGCGG ACCTGCTGGC CGGGGCGGGC GAGCGTGTGG CGGTGATCGG CCATGTCACC
GAAGGCGCGG GCGCCGTGCA CTATCGCGGG ACGCTTCTTT GA
 
Protein sequence
MTTDTPPPKP GLTYAEAGVD IDAGNTLVDR IKPAAKATSR PGVMSGLGGF GALFDLRAAG 
YADPVLVAAT DGVGTKLRIA IDTGHVDTIG IDLVAMCVND LVCQGAEPLL FLDYFATGKL
DVAEAATIVE GIARGCATSG CALIGGETAE MPGMYAKGDF DLAGFAVGAM ERGGALPANV
AAGDMILGLA SDGVHSNGYS LVRRIVERSG LGWGDPAPFE GRTLGAALLT PTRLYVQPAL
AAIRAGGVHG LAHVTGGGLT ENLPRVLPEG LGIEINLGAW ELPPVFRWLA AEGGLDEAEL
LKTFNAGIGM ALIVAPDRAE ALADLLAGAG ERVAVIGHVT EGAGAVHYRG TLL