Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0185 |
Symbol | purH |
ID | 3931627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | - |
Start bp | 154794 |
End bp | 156266 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637900341 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_506080 |
Protein GI | 88608740 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA GAGCGCTTAT ATCCGTTTAC GATAAGACGG ATCTCTTGCC TCTGGCACAC AAATTACAAG CTGCAAATGT TGAAATCATT GCAACTGGAA AGACACATAA ATATCTCCTA GAGAATCAAA TCAGAGCAAT CAATTTATCT GATTATACGC ACCAACCAGA AATACTAGGT GGTAGGGTGA AAACGCTGCA TCCCCTAATA CACGCAGGCT TGCTGGCAGA TCCGGCTTTA CACGAAGCTG AAATGCAAAC ACTGACAATA AAACAAATAG ATCTGGTAGT TGTAAATCTT TACCCATTTG AAAAGTGCGT AAATAGTGGT GCCCCAGAAA GTGAAATTAT AGAAAACATA GATATTGGTG GTGTTTCACT ATTAAGGTCT GCAGCTAAGA ATTTTAAAAA TGTTTGTGTC CTCTCTGACC CAAGCGATTA CGGTACATTC GACGTAAACC CAACTTTATC TTTCAGAACT AAAATGGCAA GAAAAGCATT TGCACGGGTC GCAAGATATG ACTGTAAGAT AGCAGAATGG TTCGCTTCCT CTACTTCTTT GCCAGAGGTG CTAAATCTAT CTGTCATGAA GAAAAATGAT TTCAGATACG GTGAAAATCC CCATCAGAAC GCATCTTTCT ACTCTAATGG AGAGTTTCCA CTCACAAAAC TTCAGGGTAA AGAACTAAGT TATAACAACC TGCTCGACCT AGACAGCGCA TTGTCGATTG TGACAAACTT CACTGAACCT ACTTGTGCAA TTATTAAACA CAGTAATCCC TGCGGTGTAG CCTCACACAA ATCCAGTCTG GAACAGGCAT ACGAGAAGGC ACTCGCTGCA GATTCGTTAA GTGCTTTTGG TGGTGTAGTT GCATTGAATC AAATTGTTAC TAGGAACGTG GCAGAAAAGC TTTTGAATAC CTTCTTTGAA GTCATCGTTG CTTATGGAAT CACACAGGAC GCTATATTTC TCTTTTCAAA CAAACCAAAT TTACGCATCC TTACATGCAA GGGTTATACT GCACCAGAAA AACACATGCT CAGTTTGCTC GGGGGACTCT TAGTACAATC TGGAAATACA AAATTATTTC AGGACTTTGA TATTGTCACA AAAAGACAAC CAACAAATGA AGAAGTTAAT CAACTTATTT TTGCCTGGAA GATCTGCAAA TATGTCAAGT CAAACGCAAT AGTGACTGCC CACGATTACA CAACTGTAGG AATAGGTGCC GGTCAAATGA GCCGCGTTAA GAGTGTAGAA ATAGCACTGG GAAAAAGCAA AGTCGGGCAG CCTCTTGCGA TGGCGTCGGA TGCGTTCTTT CCATTTGCGG ATAGTATAGA GTTAGCAGCA AAGAGTGGTG TAAAGTCAAT TATCCAACCA GGTGGCTCAA TCAGAGATAA AGAAGTAATA GAAGCAGCCG ATCATCATAA CATTGCGATG ATATTTACGC GAATGAGGCA TTTCAGACAC TGA
|
Protein sequence | MIKRALISVY DKTDLLPLAH KLQAANVEII ATGKTHKYLL ENQIRAINLS DYTHQPEILG GRVKTLHPLI HAGLLADPAL HEAEMQTLTI KQIDLVVVNL YPFEKCVNSG APESEIIENI DIGGVSLLRS AAKNFKNVCV LSDPSDYGTF DVNPTLSFRT KMARKAFARV ARYDCKIAEW FASSTSLPEV LNLSVMKKND FRYGENPHQN ASFYSNGEFP LTKLQGKELS YNNLLDLDSA LSIVTNFTEP TCAIIKHSNP CGVASHKSSL EQAYEKALAA DSLSAFGGVV ALNQIVTRNV AEKLLNTFFE VIVAYGITQD AIFLFSNKPN LRILTCKGYT APEKHMLSLL GGLLVQSGNT KLFQDFDIVT KRQPTNEEVN QLIFAWKICK YVKSNAIVTA HDYTTVGIGA GQMSRVKSVE IALGKSKVGQ PLAMASDAFF PFADSIELAA KSGVKSIIQP GGSIRDKEVI EAADHHNIAM IFTRMRHFRH
|
| |