Gene NSE_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0185 
SymbolpurH 
ID3931627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp154794 
End bp156266 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content40% 
IMG OID637900341 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_506080 
Protein GI88608740 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA GAGCGCTTAT ATCCGTTTAC GATAAGACGG ATCTCTTGCC TCTGGCACAC 
AAATTACAAG CTGCAAATGT TGAAATCATT GCAACTGGAA AGACACATAA ATATCTCCTA
GAGAATCAAA TCAGAGCAAT CAATTTATCT GATTATACGC ACCAACCAGA AATACTAGGT
GGTAGGGTGA AAACGCTGCA TCCCCTAATA CACGCAGGCT TGCTGGCAGA TCCGGCTTTA
CACGAAGCTG AAATGCAAAC ACTGACAATA AAACAAATAG ATCTGGTAGT TGTAAATCTT
TACCCATTTG AAAAGTGCGT AAATAGTGGT GCCCCAGAAA GTGAAATTAT AGAAAACATA
GATATTGGTG GTGTTTCACT ATTAAGGTCT GCAGCTAAGA ATTTTAAAAA TGTTTGTGTC
CTCTCTGACC CAAGCGATTA CGGTACATTC GACGTAAACC CAACTTTATC TTTCAGAACT
AAAATGGCAA GAAAAGCATT TGCACGGGTC GCAAGATATG ACTGTAAGAT AGCAGAATGG
TTCGCTTCCT CTACTTCTTT GCCAGAGGTG CTAAATCTAT CTGTCATGAA GAAAAATGAT
TTCAGATACG GTGAAAATCC CCATCAGAAC GCATCTTTCT ACTCTAATGG AGAGTTTCCA
CTCACAAAAC TTCAGGGTAA AGAACTAAGT TATAACAACC TGCTCGACCT AGACAGCGCA
TTGTCGATTG TGACAAACTT CACTGAACCT ACTTGTGCAA TTATTAAACA CAGTAATCCC
TGCGGTGTAG CCTCACACAA ATCCAGTCTG GAACAGGCAT ACGAGAAGGC ACTCGCTGCA
GATTCGTTAA GTGCTTTTGG TGGTGTAGTT GCATTGAATC AAATTGTTAC TAGGAACGTG
GCAGAAAAGC TTTTGAATAC CTTCTTTGAA GTCATCGTTG CTTATGGAAT CACACAGGAC
GCTATATTTC TCTTTTCAAA CAAACCAAAT TTACGCATCC TTACATGCAA GGGTTATACT
GCACCAGAAA AACACATGCT CAGTTTGCTC GGGGGACTCT TAGTACAATC TGGAAATACA
AAATTATTTC AGGACTTTGA TATTGTCACA AAAAGACAAC CAACAAATGA AGAAGTTAAT
CAACTTATTT TTGCCTGGAA GATCTGCAAA TATGTCAAGT CAAACGCAAT AGTGACTGCC
CACGATTACA CAACTGTAGG AATAGGTGCC GGTCAAATGA GCCGCGTTAA GAGTGTAGAA
ATAGCACTGG GAAAAAGCAA AGTCGGGCAG CCTCTTGCGA TGGCGTCGGA TGCGTTCTTT
CCATTTGCGG ATAGTATAGA GTTAGCAGCA AAGAGTGGTG TAAAGTCAAT TATCCAACCA
GGTGGCTCAA TCAGAGATAA AGAAGTAATA GAAGCAGCCG ATCATCATAA CATTGCGATG
ATATTTACGC GAATGAGGCA TTTCAGACAC TGA
 
Protein sequence
MIKRALISVY DKTDLLPLAH KLQAANVEII ATGKTHKYLL ENQIRAINLS DYTHQPEILG 
GRVKTLHPLI HAGLLADPAL HEAEMQTLTI KQIDLVVVNL YPFEKCVNSG APESEIIENI
DIGGVSLLRS AAKNFKNVCV LSDPSDYGTF DVNPTLSFRT KMARKAFARV ARYDCKIAEW
FASSTSLPEV LNLSVMKKND FRYGENPHQN ASFYSNGEFP LTKLQGKELS YNNLLDLDSA
LSIVTNFTEP TCAIIKHSNP CGVASHKSSL EQAYEKALAA DSLSAFGGVV ALNQIVTRNV
AEKLLNTFFE VIVAYGITQD AIFLFSNKPN LRILTCKGYT APEKHMLSLL GGLLVQSGNT
KLFQDFDIVT KRQPTNEEVN QLIFAWKICK YVKSNAIVTA HDYTTVGIGA GQMSRVKSVE
IALGKSKVGQ PLAMASDAFF PFADSIELAA KSGVKSIIQP GGSIRDKEVI EAADHHNIAM
IFTRMRHFRH