Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0031 |
Symbol | purM |
ID | 3931933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | - |
Start bp | 25144 |
End bp | 26112 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637900188 |
Product | phosphoribosylformylglycinamidine cyclo-ligase |
Protein accession | YP_505934 |
Protein GI | 88607996 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.647865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CTCTCTCCTA CAAAAGCTCC GGTGTGAATA TAGATACCGC GAACAGCTTT GTTGATTTTA TTAAAAAGAA CGCAGAGGTT GATAAGCACT GCGTTTCCGC TATTGGTGGG TTTGCTTCTT TATTCTCCGT AGGCAATTTG AATTACAAGG ACCCTGTAAT TGCTGCAGCA ACAGACGGTG TTGGAACTAA GCTATTGATA GCAAATGAGT GCAATAACCA TGCTAGTATT GGTATAGACC TAGTAGCCAT GTGCGCCAAC GACCTGATCT GTCACGGGGC AAAACCGCTC TTTTTTCTTG ATTATTACTC GACAGGGAAA CTTGAGCTCG ATGTAGCACG CGCAGTAATC TCAGGGATTC TAGTCGGGTG TAGAGAGGCT TCCATGTCAC TGGTAGGGGG AGAAACCGCT GAAATGCCCG GCTTATACAG TGCTGGTGAA TACGATTTAG CCGGCTTTGC TGTCGGAATA GTTGAAAAAG AGGAAATTCT CCCACAAAAT GTCACGAAAG GTGACATCCT TATTGGGCTG AAATCATCAG GCTTTCATGC GAATGGATTC TCACTTATCC GCAAAACTTT CTCCAGCTTA AGTATTAAGT ACTCAACACT GTTCGATAAA AGAACCTGGG GTGAGATACT CCTAACCCCA ACAAAAATAT ATGTGAATTC GTTTCTTTCT TTGAAAAAAT TCATAAAGGC GGCTGCACAC ATCACGGGTG GCGGACTACT AGAAAATCTT AGAAGGGTTC TTCCAGAAGA CCTAAAAATC TGTATTCAGC CTTATGAGTT TCCCGAAATT TTCAAGTGGC TCATGCTGAA CGGTAATATT CCGCAAGAAG AAATGTTAAC TACTTTTAAC TGTGGCATAG GAATGGTTTT GGTTGTCGCG GAGCAAAATG CTGAGTTTGT TTCCGAGACA CTAGGAGAGG AAGCTCTAAT TATCGGAAAC CTTAAGTAG
|
Protein sequence | MKKTLSYKSS GVNIDTANSF VDFIKKNAEV DKHCVSAIGG FASLFSVGNL NYKDPVIAAA TDGVGTKLLI ANECNNHASI GIDLVAMCAN DLICHGAKPL FFLDYYSTGK LELDVARAVI SGILVGCREA SMSLVGGETA EMPGLYSAGE YDLAGFAVGI VEKEEILPQN VTKGDILIGL KSSGFHANGF SLIRKTFSSL SIKYSTLFDK RTWGEILLTP TKIYVNSFLS LKKFIKAAAH ITGGGLLENL RRVLPEDLKI CIQPYEFPEI FKWLMLNGNI PQEEMLTTFN CGIGMVLVVA EQNAEFVSET LGEEALIIGN LK
|
| |