Gene NSE_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0189 
Symbol 
ID3931851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp158171 
End bp159568 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content45% 
IMG OID637900345 
Producthypothetical protein 
Protein accessionYP_506084 
Protein GI88608349 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACATTG TAAAGAAACT GATCATCTCT GGTTGGCCTG TGTGGCCCAT AATAGAAGGT 
GGTAAAGGTG TAGCTGTAAG TGATGGGGTT TCGTCAGGTG CTTTTGCTGC TGCAGGTTGT
GTAGGCACCT TTTCCGCCGT TAACGCAAAG CTTATAGATG ATAATGGGGA GATTGTCCCG
CTCGAGTATC GTAGTAAGAC CAGAAAAGGG CGCCATGACG AACTGATCGA GTACAGCATC
AAGAGCGCGA TAAGCCAGGC CAGGATAGCC CGTGAACGTT CAAAAGGAGA GGGAAGAATC
CACATGAATG TGTTGTGGGA AATGGGCGGA GTCGAAAGAG TACTGGATGG CGTTTTGTCA
AAGGTTAGTG GGTTAATTCA TGGGATCACT TGTGGTGCGG GTATGCCTTA TAAGCTCGCG
GAAATAGCCT CACGCTACAA ATTGTGTTAC TACCCAATCA TTTCTTCTGT TAAGGCTTTT
AGAATTCTGT GGAAGCGTTC GTACAGAAAG CTTAGTGAGT TTCTCGGTGG TGTTGTTTAC
GAAGATCCGT GGCTTGCTGG TGGGCACAAC GGTCTTAGCA ACACTGATAG ACCGGACGAT
ATACAGGATC CTTATCCAAG AGTTGTTGAG TTGCGTTCTT TTATGAATGA GAACGGATTG
AGCCAAGTTC CTATAGTTAT GGCAGGTGGT GTATGGTCGC TCTCAGAATG GAAGCATTTC
ATGGATAATG ATGAGGTGGG TGCAGTTGCG TTTCAGTTTG GTACGCGTCC TCTTGTGACA
AAGGAAAGCC CGATTCCTGC CATATGGAAA CAAAGGTTAT TGCAAGCTAA AAGGGGTGAC
GTCCTATTGC ACAAGTTCAG TCCCACTGGA TTTTATTCAT CTGCTCTAAA AAATAAATTC
ATACAGGCTC TTATAGATCG TTCTGAAAGA CAGATTCCCT ATTCCGAATC TCTGGAGGGG
GAGTTTGTGC TATCCTTCGA ATATGGTCCG CGTAAACGGC AGATCTTCAT AAGACATCCT
GACGAGTCTT TAGTACAGGG ATGGCTTTCT TCTGGGTATA CAGAGGTTGT TAAGACTCCT
GATCGTTCTG TTGTGTTTCT CACACCGGAT GAGTTTGCGT TGATTCGGGC AGATCAGATG
AATTGTATGG GCTGCCTTAG CCATTGTAAG TTCAGCAATT GGAAAGACCA TGATGATTAC
ACAACAGGTG AATTACCGGA TCCTAGAAGT TTTTGTATAC AGAAAACGCT TCAGAACATG
GTATACGGGG CTGATCCTGA TACAGAGTTA GCCTTCGCTG GGCATAATGC GTATAGGTTT
TCCACAGATC CTTTATACAG GGATGGACAC GTACCAACCG TAAAAGAACT GGTCGAGAAG
ATTCTTGCTG GTGAATAA
 
Protein sequence
MNIVKKLIIS GWPVWPIIEG GKGVAVSDGV SSGAFAAAGC VGTFSAVNAK LIDDNGEIVP 
LEYRSKTRKG RHDELIEYSI KSAISQARIA RERSKGEGRI HMNVLWEMGG VERVLDGVLS
KVSGLIHGIT CGAGMPYKLA EIASRYKLCY YPIISSVKAF RILWKRSYRK LSEFLGGVVY
EDPWLAGGHN GLSNTDRPDD IQDPYPRVVE LRSFMNENGL SQVPIVMAGG VWSLSEWKHF
MDNDEVGAVA FQFGTRPLVT KESPIPAIWK QRLLQAKRGD VLLHKFSPTG FYSSALKNKF
IQALIDRSER QIPYSESLEG EFVLSFEYGP RKRQIFIRHP DESLVQGWLS SGYTEVVKTP
DRSVVFLTPD EFALIRADQM NCMGCLSHCK FSNWKDHDDY TTGELPDPRS FCIQKTLQNM
VYGADPDTEL AFAGHNAYRF STDPLYRDGH VPTVKELVEK ILAGE