Gene NSE_0965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0965 
SymbolpurK 
ID3931919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp851895 
End bp852944 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content38% 
IMG OID637901119 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_506829 
Protein GI88607982 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.676938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTTT ATAGTAGGAA AATGCAGCGC ATTGGAATAA TCGGAGGCGG ACAGCTAGGA 
AAAATGACAG CCATAGCTGC ATATAACCTG GGTTTTAAAG TTTGTGTCCT AGCAGAGAAG
GAGAACTCCC CAGCTATAGA TGTAGCCAGA GATTATGTTG TTTCACCTTT CCTCGACAGG
TCAGGGATTC TTAACTTTGT AGAACATGTA GACGCTATAA CATTTGAGAG CGAAAACATA
CCAACAGAGA CGCTTGATCT ACTCCACGAT AAGTTCGACG TTCCAAATAC TAAAGCGATT
AAGGTAGCAC AGGACAGGTT CTTAGAAAAA GAATTCTTAA GGAAAAATGG AATACCAACT
ACCGAATATT GGTACATCGA AAAGGAGGAG GACCTTGATA GTGTAGATTT TCCAGCAATA
TTAAAAACAA TCAATGGTGG CTATGATGGA AAAGGACAAT TCCTCCTAGA GGATCATGAT
TATGTAAGAA GAGAAGCCGG AAACCTCAAA TTCCCTCTAA TAGCAGAAAA GTTATTTAGG
ATAAGTAAAG AATTCTCCAT AATAGTTTCG AGAAATGAAA CTGGAAGTGT GTGTTTTCCG
ATAGCAGAAA ATGTTCATGT TAATGGAATA CTCAAAACAT CCAGCGTGCC AGCTGTACTT
CCTCATCATG TCGCACTTGA AATAAAAAAC ATAGGATTTC AAATAGCAGA TCTATTAGAA
ATAAAGGGTC TCTTGTGTGT AGAATTTTTT CTCGACGAAG ATAACAAGCT AGTTGTGAAT
GAAATCGCTC CTAGACCACA CAATTCTGGT CACTGGAGCA TGGATTGCTG CGACATTGAC
CAATTTGAAG AACTGGTTCT TGCAATTACA GGTAATAAAC TCAAAAAGCC TAATCTCGTA
GTTCCGTGCA CAATGAAGAA TATTCTTGGC AATGAAATAA ATACTTGGAA AGATTTATTC
CTACAAAAAA ATGTAAAACT CTACAACTAT GGTAAAGAAC AGCCTAAAAT CCTAAGGAAA
ATGGGGCATA TAAACATTCT GCATCCGTAA
 
Protein sequence
MDLYSRKMQR IGIIGGGQLG KMTAIAAYNL GFKVCVLAEK ENSPAIDVAR DYVVSPFLDR 
SGILNFVEHV DAITFESENI PTETLDLLHD KFDVPNTKAI KVAQDRFLEK EFLRKNGIPT
TEYWYIEKEE DLDSVDFPAI LKTINGGYDG KGQFLLEDHD YVRREAGNLK FPLIAEKLFR
ISKEFSIIVS RNETGSVCFP IAENVHVNGI LKTSSVPAVL PHHVALEIKN IGFQIADLLE
IKGLLCVEFF LDEDNKLVVN EIAPRPHNSG HWSMDCCDID QFEELVLAIT GNKLKKPNLV
VPCTMKNILG NEINTWKDLF LQKNVKLYNY GKEQPKILRK MGHINILHP