Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0007 |
Symbol | |
ID | 3931856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | + |
Start bp | 6618 |
End bp | 7808 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637900164 |
Product | HK97 family phage portal protein |
Protein accession | YP_505910 |
Protein GI | 88608802 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000540327 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAACAA AAATATTTAA AAAATCAAGA AGCAAAGCTT GCCCCAATAG CAGAAGCCAA CTAGGCGGTG AGTGTCTATA TAATACATAT ACAGAACTCT GGAGCGGGAG AGACTACAGT GCATTTGCCC AAAAAGCGTA CATAAGAAAT GTGATCGCCT CACGTGCAAT CAGTATAGTA GCAGTGGCCG CCTCTTCTGT TCCTATACAA CTTTTTGAAT GTAGTGGGAC GGAGAAGAAA CGTTGCTTAG TCTCTCATCC GCTCAATGAG CTATTAAACG AGCCTAATCC CAATATGTCA AGAGTAACAT TGATAAAAAA TGCTGTTATG TACAAGCTAA TCAGCGGAAA TTTATACCTT CTTAAGATCG GGAACGGTTT ACCAAAAGAA CTGCACCTTC TAAGACCTGA TAGGGTCACA GTTATACCAG GAAGTGATTG TTTACCTTTG GGATACAGAT ATCGAGTTGG AAACTACGAA AGGGAATATT ATATGAACAA AATCACTGGG GACTGTGATG TTCTACACAT AAAGAATTTT CACCCGTATA ACGACTGGTA TGGATTATCG CCAGTAGAGG CGGCAATGTA CAGCATAGAT CAACATAATC AGGCTAGTCT TTGGAACCAA GCTATGCTCA AAAATGGTGC TAGGCCCAGC GGAGCGTTTA TATCAAAATC TAAGGAGCCA ATGCCACAAG AACAGTTCAA ACGTTTAAGC AGACAACTTG GAGACTGTTC CGGTGCTGAA AATGCAGGAA AAGCTATTCT TATAGAGGGA GGGATAGAGT GGAAAGAAAT GAGCATCTCT CCAAAGGAAA TGGATTTTCT TCAAAGTAAG TACAATTCCG CCAGAGAGAT AGCGTTGGCA TTTGGTGTAC CGCCACAATT ACTGGGAATA CCGGGAGACA ACACATATAG CAACTTAATA GAAGCTAGAC TATCCCTATG GGAAGAAACA GTTCTCCCTA TACTCGATGA AATAGTGCAT AGCTTAAATG TTTGGCTTAC TCCGGTTTTT GGAGATAATT TAGAGTTCGC GTACGAAAAA GATGGTATAG ACGCGCTATC CAAGAAGAGA GAAAAACTTT GGGATTGTGT AGAAAAGGCA TCTTTTTTGA CAATCAATGA AAAAAGGCAA GTATTTGGTT ATTCCACTAT GGAGGGTAAA GACGAAATAG TAGAGTCGTA A
|
Protein sequence | MLTKIFKKSR SKACPNSRSQ LGGECLYNTY TELWSGRDYS AFAQKAYIRN VIASRAISIV AVAASSVPIQ LFECSGTEKK RCLVSHPLNE LLNEPNPNMS RVTLIKNAVM YKLISGNLYL LKIGNGLPKE LHLLRPDRVT VIPGSDCLPL GYRYRVGNYE REYYMNKITG DCDVLHIKNF HPYNDWYGLS PVEAAMYSID QHNQASLWNQ AMLKNGARPS GAFISKSKEP MPQEQFKRLS RQLGDCSGAE NAGKAILIEG GIEWKEMSIS PKEMDFLQSK YNSAREIALA FGVPPQLLGI PGDNTYSNLI EARLSLWEET VLPILDEIVH SLNVWLTPVF GDNLEFAYEK DGIDALSKKR EKLWDCVEKA SFLTINEKRQ VFGYSTMEGK DEIVES
|
| |