Gene NSE_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0023 
Symbol 
ID3931447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp18761 
End bp21073 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content48% 
IMG OID637900180 
Producthypothetical protein 
Protein accessionYP_505926 
Protein GI88608147 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACAA AGATATGTGA AGCAATTAAA AGTTCGGACA TTCTTACTTT AAGGTGTCTA 
ATGCAAGAGG CAGGATTGGA GGAAGCGAAC GGTACTGCCT CAGCACAAGG TCGACCTCTC
TGGTCCATGG TTGATTCTGA TGGGCGTAGT GTTCTGCAAG TTGCTGTAGA AAGTCAAAAT
ACAGGTGTGC TTAAATGCGT GATGGAGGCA CTACCCGGTA ACTGCTTAGT AGATCTGTGT
GCTAAGAAGA CAAGTACGAG CCATCCTTTT TTCCCTGAAA GCACCCCACT GCACACCGCA
ATTGCAGTTG GGCGCATGGA AGTCCTGGAG TCACTACTAC GCGGAATGTT AAGCGCGCGT
GGACAAAATG GTAAACCGCC ATCTAGTACT GTGGTTGTTT GGCATACGGC TGATGCAAAT
GGGCGAACAC CTTTAGCCGC CGCATTGGCC ACCGGCAAGC TAGAGATTGT GAAGCTAGTC
CTTGGAGCAA TAAAGACTCT GGAGCGAGAT ATGCAGACGG CTAGAGGAGA TCGTGTACCA
CTGGTCCCCG GTATTCTTCA AATTGATGAT ATCAGAGTTA GGGGTGATAA CGGACTTAAT
ATGGCTGTAT CTACAGGTAA TGTGGGTATA GTTGAGGTTC TGATCGATGC TCTGACTCCT
GAGGAGCTGG CACCTATACT AACAAGGTGT AATTTAGTGG GTGACACTGC ACTTGACCAG
GCCGCGCGTG CAGGAAATGT AGACATTGTG AGGCTTTTAG TCAAAAAGCT GGGGGATTTG
TATGAGCCGT GGGTTAGTAC ACCTTTCCAG GTTGTTGTGG AGTCCTGCGC GGAGAGTAAA
TACAAAGGGA ATGGAGCGCA AGTTCCAGAG CACTTTTCAA TGCGTCTTTT GGGTAACGCA
GTTTCGAGTG GTAATGCTCA AGTTGTTAAG GAAGTTTTAG GACCTCTGTC TATCGAAGAG
CGGTATAAGC TGTTGAGTCG TACCAGTTGG GGTTATCCTT ACCCTGCACT ACACCTGGCA
TTCTCCTCAG GGGCACACGA ATGCGTAAAA ATAATGCTTG ACTCGCTTGT TGCTTGTCCT
GGTGGAGGTA CGAAGTATGT GTCAAAAGTT CTTGCTCAAC GTTCCGGTGG GCTTACTCCG
CTGCATTGTG CCGTTGATGC CAAGTCAGTT GCGGTAGTAC AAGGGTACGG ACTACCGAAG
GGGGTCCTGT TCGGGCTACT TACGGCTGGA GGCTCGTCCT GTTCTCCTGA AGGACCTGTT
TACGTGCCTG ATGGTATGAA TCCACTTCAA GCTATGCTTG CTGGTCCTAC TGGTGATGGG
AATCCTGTTC CTGGGGCAGT TGGTGTAATT AGGGCTATGC TGGATCTTTT AGAAGGGGAT
GCAGTTTTGA TGCAGCGTGT GCTTTCTTCA ATGGATGCGG GAGGACGTAA CACTCTGTAC
ACTTTTGCAA GTCTTGTAGG TTCAAGGGGT GTTTCTGCCA CTGACTTTGT GACTATGCTC
AACTACCTTG AAGGCAGAGT AAACCTCAGG ACTTTACTCG AGCAAAAAAA CATGACGGAG
GATGTCTCTA CACTTGATGT TGTTCACGAA GCGGAACACA ATTTGTACAG TTGGGGTTTT
AGTCTGAACG ATCATGTTTC CCAAGCGTTA AATAGACAAA GAGGAGTTTC AAAGAGCCGG
GCTTCCAGGT TAAAGGTAGC GGGTGATGCT GCAGTGTTAT CCAGTTTTTT CTTGTCCTGC
ATATCAGGGC TTATTGTGGT ATGTAGCGCC CTTTATAATG TATGTTCTAC GGGAGTTAAA
AAGAAAAGCG CTGGATTTAC CGTATTTGAA ATAGCATACA TTGTATTTGC TTTCTCTGTC
GTTATGCTTC TCCTCACTTT TTTCTGTATA ACTCCAGGTA TGCACGGTGC AGCTAATCGG
TGTGCTGTGA TAACAGGCGA TCCTAGAGTA AACATCCCTG AGCCGAGTTT TAATGACGGG
ATATCGATAT GCGCAAACAC AGAGATCCAA TTCAGACCTG AAGACGTCGA GAAAATAGAA
ATGGAAAGGT TGCATCTTAG AAGCAAAAGA GACCCAATGG CGGTCTTTTC TTCATCTCCT
TCTCTTTCCT CCTCCGCTCT GAGTGCACGA CCTGTACCTG GTTTGCTTGT TGAACAGTTG
AGTGCCTTCC ATGTACGCGA TCCGGGCCAC CACTGTTGTT CTACATCTGA GAATGAGGGA
TCCTATGCGT TGGAGGAGTC TCCTGGTACA GGGATACAGG GTTTGGCAGC GGAGGGACTA
TGTGATGAAC AACAGAAAGG CGTTGCTGAG TAA
 
Protein sequence
MYTKICEAIK SSDILTLRCL MQEAGLEEAN GTASAQGRPL WSMVDSDGRS VLQVAVESQN 
TGVLKCVMEA LPGNCLVDLC AKKTSTSHPF FPESTPLHTA IAVGRMEVLE SLLRGMLSAR
GQNGKPPSST VVVWHTADAN GRTPLAAALA TGKLEIVKLV LGAIKTLERD MQTARGDRVP
LVPGILQIDD IRVRGDNGLN MAVSTGNVGI VEVLIDALTP EELAPILTRC NLVGDTALDQ
AARAGNVDIV RLLVKKLGDL YEPWVSTPFQ VVVESCAESK YKGNGAQVPE HFSMRLLGNA
VSSGNAQVVK EVLGPLSIEE RYKLLSRTSW GYPYPALHLA FSSGAHECVK IMLDSLVACP
GGGTKYVSKV LAQRSGGLTP LHCAVDAKSV AVVQGYGLPK GVLFGLLTAG GSSCSPEGPV
YVPDGMNPLQ AMLAGPTGDG NPVPGAVGVI RAMLDLLEGD AVLMQRVLSS MDAGGRNTLY
TFASLVGSRG VSATDFVTML NYLEGRVNLR TLLEQKNMTE DVSTLDVVHE AEHNLYSWGF
SLNDHVSQAL NRQRGVSKSR ASRLKVAGDA AVLSSFFLSC ISGLIVVCSA LYNVCSTGVK
KKSAGFTVFE IAYIVFAFSV VMLLLTFFCI TPGMHGAANR CAVITGDPRV NIPEPSFNDG
ISICANTEIQ FRPEDVEKIE MERLHLRSKR DPMAVFSSSP SLSSSALSAR PVPGLLVEQL
SAFHVRDPGH HCCSTSENEG SYALEESPGT GIQGLAAEGL CDEQQKGVAE