Gene NSE_0242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0242 
Symbol 
ID3931999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp205891 
End bp207426 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content46% 
IMG OID637900398 
Product51 kDa major antigen 
Protein accessionYP_506136 
Protein GI88608614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000185861 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAC TTAGCAAGAT ATTACTTTTG ACGACGGCAC TAGCCAGCGT CGCAGGCGCG 
TCCGAGGTGC CTTTGACAGA GGATCAAGTT CCGGCTGTTG AGAAGACGAC ATCGAATAAA
CCATGTGTCT GCAATAAAGC AGGCCCTAAT CAGGTCAAGG CCCGATTGAG TAAATTTGCG
GACCATTGTG CAACTGGTAT GGGCTCGCGT GGATGCAGTT GTGATGGTTC GTCTGACCTA
AATGGGACCA GTAACTGCGA TGCAGTGAAT TTCGTGTTCA AAGTAAAGGG TAGTAACGAT
TTCTCGTTCG GTTACGCGAG CAATCAGGAC TTCTTCAAGT TAGCGAAAGG TCTTCCAAAA
ATCGATGTCC TTGTAGATGC AAGCGGTAAA GATATTGAAA GCTCATACAA TGGAGATAGT
AGCACTAGTG GTACTGTGAA CGGCGTTAAG GCCCTTTCTG ATGGTGGCGT TCTAGGTGAT
TACACACGTA GCACTGATTT GTTCAACGAG CACAAGTTAT CAATAGAAGC TAGACGTACA
CTAGGCAGCT TCGCTTATGG TGGTTTGCTA GAAGCAGAAT TTAGTAGGAA AGATGCGGTT
AGTGCCGATA ATGCATACGT TTTCTTTGAA ACCGGTTACG GAAGATTTGA AATGGGCCGC
ATCACTGACA GTGCTGTGGA ACCGCTCAGG ATTGATGCAT CTTCCATCGC TGCTGTTGGT
GGTGGTTTTG GTGATCTAAA TTGGACGACG CTAGCTAACC TTGAAGGACG CCCTATAGGT
GCTACGCATA GCACAACAGG GAATGGTGAT AGCCAGAAGA GCAGCAGCAC GCGTCATAGG
GATGCACAGC GCCCTTTCTT GGTGCATGCA AACTACTATA CCGCATATAA CAATCCACTA
AGGGCTAACT TCATTACTAC TGGGCTGGGC AATTTGCGGA TGGCATTGGG TTATACGAAC
TCTACTGCGG ATGGTACATA CCATGATATT ATTGATGTAG GTGCTGGCTA TGCTGGGAAG
AAAGGAAATC TGAAGTATGC TATTTCCTTC AGTGGTCAGG CTGGTCTCAG CACTCCAACT
GGTGATGAAC ATCACCCTCT CAGACGTTTT GAAGTCGGTG CATCGGTTCA GCTTCACACT
ATAAAGCTTG CTGGATCATG GGGTAGTACG TATCTCTCTG GAGTTAAAAA ATCGAAGGAT
ATGCAACTTG ATTTAACTAA GGCTTTTGCT GATAGCAGTC AACTCAAAAA AACAGACGGT
GATAGTACTT ACATGACTTT CGGTGCTACA TATGAAGAAG GTCCTGTGAT GTTTAGCCTT
GGCTATATGG AGAGTTATAA TACCTTCGTT AAAAGTGTCG GAGTGAATAC GCTAAGAGTT
GTTTCCCTTG GTACGCATTA TCGCATCACT GGAAGCACGT ACGAGCTTAC GCCTTACATT
AACACCAAGT GTTTCATGGC TCAGGAAGCT GGGATTAAGG CTGAGGACAA CAACAAAGGT
TTTGTTCTTG CTTCCGGTGT GAAGGTATCG TACTAA
 
Protein sequence
MYKLSKILLL TTALASVAGA SEVPLTEDQV PAVEKTTSNK PCVCNKAGPN QVKARLSKFA 
DHCATGMGSR GCSCDGSSDL NGTSNCDAVN FVFKVKGSND FSFGYASNQD FFKLAKGLPK
IDVLVDASGK DIESSYNGDS STSGTVNGVK ALSDGGVLGD YTRSTDLFNE HKLSIEARRT
LGSFAYGGLL EAEFSRKDAV SADNAYVFFE TGYGRFEMGR ITDSAVEPLR IDASSIAAVG
GGFGDLNWTT LANLEGRPIG ATHSTTGNGD SQKSSSTRHR DAQRPFLVHA NYYTAYNNPL
RANFITTGLG NLRMALGYTN STADGTYHDI IDVGAGYAGK KGNLKYAISF SGQAGLSTPT
GDEHHPLRRF EVGASVQLHT IKLAGSWGST YLSGVKKSKD MQLDLTKAFA DSSQLKKTDG
DSTYMTFGAT YEEGPVMFSL GYMESYNTFV KSVGVNTLRV VSLGTHYRIT GSTYELTPYI
NTKCFMAQEA GIKAEDNNKG FVLASGVKVS Y