Gene NSE_0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0466 
Symbol 
ID3931776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp381860 
End bp382969 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content36% 
IMG OID637900622 
ProductHK97 family phage major capsid protein 
Protein accessionYP_506351 
Protein GI88608773 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACAA GAGTAAATAA TTTACTTCTA AACCTCGAAA CAGAAATTAA ATCTTTTAAA 
GAGAAAGAAG ACGAAAGATT ACAGAGTTGT GAGAGAAGAT TAAAAAAACT AGAAGATGGC
TATGACTTGG CTGTGAAACG ACCTTTTACT TCGACCGAAA TAAGCGAAGA TTGTCATTAT
AAGAAGCAAC TCAATAGTTA TCTTAAGAAG GGATCAGCAA ATTCCATTGA AATCAAGAGT
AATGGCTTTC CACTAACAGA TACCTTAATT AACTCAATCC ATGAGCAAAT GAAGAAGCTC
TCCCCAATAA GGAAGCTAGC ATCTGTTAAT ACGATCTCAA CAGACTGTAT AACTTACATA
AGCAATGGTG ACGCGTTAGA GGCACAATGG GTTTCTGAGG ATGAGATAAC TAATAGCGAT
GCAAAAGAGC ACATAATCAC AACAGAAATA AATACCTATG AGCTCTACAC ACAACCAAAG
ACCACTCAAA AAGTCCTTGA TGATCCATCT ATAAACGTAG AAAAATGGTT AATTGAACAG
GCTGCTTTGG CATTTACACA ACTCGAAAAC GAAACCTTCA TTACTGGTGA TGGAAAAAAT
AAACCACACG GAATACTTCA ACATAATACA GATTCCACAG GAAAACAGAT TTGTAAGATC
ACAAGCAAGG AGAAAGACAA AATAACCCCT GCTGATATTT TAAGCCTCTA TTATGGATTA
CAAAGAGAGT TCTCTGTTAA TGGAACATTT CTTATGCACA GTACAACTAT CCAGCTAATA
AGAGGGCTAA AATATGAACA GACAGGACAC TATATGTTTC AGCCAGGAAT TTTAGTAGGT
CAACCCGACA CGTTAATGGG AATACCTGTT ATTGAGTGCT CAGAAATGCC TATAATACCA
ATATCTGATA AATCTACGAG TTCAACAAAG AAATATCCAA TCATTTTTGG AGATTTTAAG
CGTGGATATC AAATCGTCGA TAGAAGCGAA ATGCGGGTTC TAAGAGATCC ATACTCTGCA
AAACCTTATG TTTCTTTCTA TATTACCAAA AGGGTTGGCG CAAAGGTAAT TAATCCAAAG
GCTTTTGTTT TTTTGGAAAT GAAATCCTAA
 
Protein sequence
MDTRVNNLLL NLETEIKSFK EKEDERLQSC ERRLKKLEDG YDLAVKRPFT STEISEDCHY 
KKQLNSYLKK GSANSIEIKS NGFPLTDTLI NSIHEQMKKL SPIRKLASVN TISTDCITYI
SNGDALEAQW VSEDEITNSD AKEHIITTEI NTYELYTQPK TTQKVLDDPS INVEKWLIEQ
AALAFTQLEN ETFITGDGKN KPHGILQHNT DSTGKQICKI TSKEKDKITP ADILSLYYGL
QREFSVNGTF LMHSTTIQLI RGLKYEQTGH YMFQPGILVG QPDTLMGIPV IECSEMPIIP
ISDKSTSSTK KYPIIFGDFK RGYQIVDRSE MRVLRDPYSA KPYVSFYITK RVGAKVINPK
AFVFLEMKS