Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0466 |
Symbol | |
ID | 3931776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | - |
Start bp | 381860 |
End bp | 382969 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637900622 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_506351 |
Protein GI | 88608773 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACAA GAGTAAATAA TTTACTTCTA AACCTCGAAA CAGAAATTAA ATCTTTTAAA GAGAAAGAAG ACGAAAGATT ACAGAGTTGT GAGAGAAGAT TAAAAAAACT AGAAGATGGC TATGACTTGG CTGTGAAACG ACCTTTTACT TCGACCGAAA TAAGCGAAGA TTGTCATTAT AAGAAGCAAC TCAATAGTTA TCTTAAGAAG GGATCAGCAA ATTCCATTGA AATCAAGAGT AATGGCTTTC CACTAACAGA TACCTTAATT AACTCAATCC ATGAGCAAAT GAAGAAGCTC TCCCCAATAA GGAAGCTAGC ATCTGTTAAT ACGATCTCAA CAGACTGTAT AACTTACATA AGCAATGGTG ACGCGTTAGA GGCACAATGG GTTTCTGAGG ATGAGATAAC TAATAGCGAT GCAAAAGAGC ACATAATCAC AACAGAAATA AATACCTATG AGCTCTACAC ACAACCAAAG ACCACTCAAA AAGTCCTTGA TGATCCATCT ATAAACGTAG AAAAATGGTT AATTGAACAG GCTGCTTTGG CATTTACACA ACTCGAAAAC GAAACCTTCA TTACTGGTGA TGGAAAAAAT AAACCACACG GAATACTTCA ACATAATACA GATTCCACAG GAAAACAGAT TTGTAAGATC ACAAGCAAGG AGAAAGACAA AATAACCCCT GCTGATATTT TAAGCCTCTA TTATGGATTA CAAAGAGAGT TCTCTGTTAA TGGAACATTT CTTATGCACA GTACAACTAT CCAGCTAATA AGAGGGCTAA AATATGAACA GACAGGACAC TATATGTTTC AGCCAGGAAT TTTAGTAGGT CAACCCGACA CGTTAATGGG AATACCTGTT ATTGAGTGCT CAGAAATGCC TATAATACCA ATATCTGATA AATCTACGAG TTCAACAAAG AAATATCCAA TCATTTTTGG AGATTTTAAG CGTGGATATC AAATCGTCGA TAGAAGCGAA ATGCGGGTTC TAAGAGATCC ATACTCTGCA AAACCTTATG TTTCTTTCTA TATTACCAAA AGGGTTGGCG CAAAGGTAAT TAATCCAAAG GCTTTTGTTT TTTTGGAAAT GAAATCCTAA
|
Protein sequence | MDTRVNNLLL NLETEIKSFK EKEDERLQSC ERRLKKLEDG YDLAVKRPFT STEISEDCHY KKQLNSYLKK GSANSIEIKS NGFPLTDTLI NSIHEQMKKL SPIRKLASVN TISTDCITYI SNGDALEAQW VSEDEITNSD AKEHIITTEI NTYELYTQPK TTQKVLDDPS INVEKWLIEQ AALAFTQLEN ETFITGDGKN KPHGILQHNT DSTGKQICKI TSKEKDKITP ADILSLYYGL QREFSVNGTF LMHSTTIQLI RGLKYEQTGH YMFQPGILVG QPDTLMGIPV IECSEMPIIP ISDKSTSSTK KYPIIFGDFK RGYQIVDRSE MRVLRDPYSA KPYVSFYITK RVGAKVINPK AFVFLEMKS
|
| |