Gene NSE_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0166 
Symbol 
ID3932058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp133504 
End bp134925 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content45% 
IMG OID637900322 
Productperiplasmic serine protease 
Protein accessionYP_506063 
Protein GI88608224 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.358427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAA ATAGGTTTTA TGAGATATTT TTCCTAGCTG TACTTTTGAA CTGTTCTTTT 
GCTGCTGGGG CTGTGCCATC TGAGGGATTT TCTGATGTTG TCTCAAGGCT TGCTCCTGCG
GTTGTGAATA TTTCCAGCGA ATACAGGTTA AGGATGGATA ACCAGGGTCT GTGTGGTAAC
CCGGCTATTT TGGAAGAGTT TTCTGAATTT TGTGAGCGCC TTAAGCCGTT TTTTCGCAAC
AAGAATCCGG GTAAGAAGTA TGGGACATCT TTGGGTTCTG GCTTTTTGAT TTCCGATGAT
GGGCTTATAG TTACGAATTA TCATGTCATT GCGAATGCTG ATAAAATCAG GGTCGTTCTG
AGTCAGTGTA GTGAAGCGTG CCAACAGTAT GAGGCTACGG TGATTGGGTA TGATAAAAAA
ACCGACCTTG CTGCGCTAAA AATTTCTGGA GTAAGTGGTC TTCCATATTT GCGTTTTGGT
GATTCGTCTA AAATGAGGCC CGGGGACTGG GTGATAGCAG TTGGTAATCC TTTTGGTTTA
GGGGGCTCTG TCAGTGCTGG AATTGTTTCA GCGATCAGTA GAGAGATCGG TCTATCTCAG
AACAGCGATT TTATACAGAC GGACGTTGTG CTCAATTCTG GTAATTCTGG AGGACCGCTT
TGTAATGCAA AGGGTGAAGT AATTGGTGTA AATACGGCTG CTGTGTATTC TAATGGTGGA
AGTGCAGGGA TTGGTTTTGC CGTGCCATCG AATGTTGCGA AGCCAGTGAT AGAAGCTCTA
GCTAAGGGTA AGCAGATTCA GCGTGGATGG ATAGGGATTG TCATCCAAGA GATCACAAAC
GAAACAAAAG ACTCACTTGG TGGAGACTTA TCCGGTGTTC TGGTGGCGAG TGTTGAAAAA
GATGGACCTG CGTATAAAGC TGGGATGAGG GTTGGAGACG TTATTACAGC TGTGAACGGG
GAGAAAATTA GCGGTTCAAG GAGATTAGTA AGGGAAGTTT CCGGGCGGAG AATAGGTGAT
ACAATAGAAC TGTCTGTAGT CAGGGATGCT CTTAAAAATA AGGAGACGGT GTCTTTGAAG
GTAAAAATTG AAAAAACACC GCAAAGGTAT GCAGACGATG GGGCATCGCA GTTAGAGGTC
ATAGGATTAG TGGTTTCCAA TCTGACTGAT ACGATTAGAA ATTCGTTTGG GCTTGGTGCT
AGCATTGAAG GGGTGGTGGT CTTGGCCGTT GATCCTGACA AAGAGAGTTT TCTCAAAGCT
GGGGATATAA TTATTGGGGT TGGTACCAAC AGGCAGATTT CTACTGTCCA GGAGTTTAAA
CAGCACATAG ACGAAGCAAA GAAAAAGGGG CAAAGGTCGC TCCTAATGCT GATAAATCGT
GGTAAGCAGA CGATCTTTGC GGCTGTAGGC TTGGATAACT AG
 
Protein sequence
MRRNRFYEIF FLAVLLNCSF AAGAVPSEGF SDVVSRLAPA VVNISSEYRL RMDNQGLCGN 
PAILEEFSEF CERLKPFFRN KNPGKKYGTS LGSGFLISDD GLIVTNYHVI ANADKIRVVL
SQCSEACQQY EATVIGYDKK TDLAALKISG VSGLPYLRFG DSSKMRPGDW VIAVGNPFGL
GGSVSAGIVS AISREIGLSQ NSDFIQTDVV LNSGNSGGPL CNAKGEVIGV NTAAVYSNGG
SAGIGFAVPS NVAKPVIEAL AKGKQIQRGW IGIVIQEITN ETKDSLGGDL SGVLVASVEK
DGPAYKAGMR VGDVITAVNG EKISGSRRLV REVSGRRIGD TIELSVVRDA LKNKETVSLK
VKIEKTPQRY ADDGASQLEV IGLVVSNLTD TIRNSFGLGA SIEGVVVLAV DPDKESFLKA
GDIIIGVGTN RQISTVQEFK QHIDEAKKKG QRSLLMLINR GKQTIFAAVG LDN