Gene NSE_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0007 
Symbol 
ID3931856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp6618 
End bp7808 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content40% 
IMG OID637900164 
ProductHK97 family phage portal protein 
Protein accessionYP_505910 
Protein GI88608802 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000540327 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACAA AAATATTTAA AAAATCAAGA AGCAAAGCTT GCCCCAATAG CAGAAGCCAA 
CTAGGCGGTG AGTGTCTATA TAATACATAT ACAGAACTCT GGAGCGGGAG AGACTACAGT
GCATTTGCCC AAAAAGCGTA CATAAGAAAT GTGATCGCCT CACGTGCAAT CAGTATAGTA
GCAGTGGCCG CCTCTTCTGT TCCTATACAA CTTTTTGAAT GTAGTGGGAC GGAGAAGAAA
CGTTGCTTAG TCTCTCATCC GCTCAATGAG CTATTAAACG AGCCTAATCC CAATATGTCA
AGAGTAACAT TGATAAAAAA TGCTGTTATG TACAAGCTAA TCAGCGGAAA TTTATACCTT
CTTAAGATCG GGAACGGTTT ACCAAAAGAA CTGCACCTTC TAAGACCTGA TAGGGTCACA
GTTATACCAG GAAGTGATTG TTTACCTTTG GGATACAGAT ATCGAGTTGG AAACTACGAA
AGGGAATATT ATATGAACAA AATCACTGGG GACTGTGATG TTCTACACAT AAAGAATTTT
CACCCGTATA ACGACTGGTA TGGATTATCG CCAGTAGAGG CGGCAATGTA CAGCATAGAT
CAACATAATC AGGCTAGTCT TTGGAACCAA GCTATGCTCA AAAATGGTGC TAGGCCCAGC
GGAGCGTTTA TATCAAAATC TAAGGAGCCA ATGCCACAAG AACAGTTCAA ACGTTTAAGC
AGACAACTTG GAGACTGTTC CGGTGCTGAA AATGCAGGAA AAGCTATTCT TATAGAGGGA
GGGATAGAGT GGAAAGAAAT GAGCATCTCT CCAAAGGAAA TGGATTTTCT TCAAAGTAAG
TACAATTCCG CCAGAGAGAT AGCGTTGGCA TTTGGTGTAC CGCCACAATT ACTGGGAATA
CCGGGAGACA ACACATATAG CAACTTAATA GAAGCTAGAC TATCCCTATG GGAAGAAACA
GTTCTCCCTA TACTCGATGA AATAGTGCAT AGCTTAAATG TTTGGCTTAC TCCGGTTTTT
GGAGATAATT TAGAGTTCGC GTACGAAAAA GATGGTATAG ACGCGCTATC CAAGAAGAGA
GAAAAACTTT GGGATTGTGT AGAAAAGGCA TCTTTTTTGA CAATCAATGA AAAAAGGCAA
GTATTTGGTT ATTCCACTAT GGAGGGTAAA GACGAAATAG TAGAGTCGTA A
 
Protein sequence
MLTKIFKKSR SKACPNSRSQ LGGECLYNTY TELWSGRDYS AFAQKAYIRN VIASRAISIV 
AVAASSVPIQ LFECSGTEKK RCLVSHPLNE LLNEPNPNMS RVTLIKNAVM YKLISGNLYL
LKIGNGLPKE LHLLRPDRVT VIPGSDCLPL GYRYRVGNYE REYYMNKITG DCDVLHIKNF
HPYNDWYGLS PVEAAMYSID QHNQASLWNQ AMLKNGARPS GAFISKSKEP MPQEQFKRLS
RQLGDCSGAE NAGKAILIEG GIEWKEMSIS PKEMDFLQSK YNSAREIALA FGVPPQLLGI
PGDNTYSNLI EARLSLWEET VLPILDEIVH SLNVWLTPVF GDNLEFAYEK DGIDALSKKR
EKLWDCVEKA SFLTINEKRQ VFGYSTMEGK DEIVES