Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0121 |
Symbol | |
ID | 3931632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | - |
Start bp | 99201 |
End bp | 100451 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637900277 |
Product | hypothetical protein |
Protein accession | YP_506021 |
Protein GI | 88608745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.651767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCTCTTT TTACATCTGT TATGAAGCAG AGATGGCTCT CTCTTGGTGT CCTTGCTTTG GGACTCTCGG GGATTTTCTC CCTATTTGTA GTCCTACTGA GATTACCCTT CTTCATTGGT TTAGTACCAA ATATTTTTGA TGTTTCACTC GTTATACATG TTACACTCGG CATCAACGCG TGGATGCTTG CTATCACAGC TGCTACCATA GCAGGAGAGG GGTATTTCCA AAAATTCAGT CACAACTTAT GTCTGCTAGG AGTAATTTTT ATTTTTATTG CAGGCTTTAT CCCGGAAACG CACGCGTTGA AAAATAACTA CATCCCCTTC TTAAATAATT TACCTTATGT TCTAGGCACA GCGCTACTTT CATTTGGACT TTCCGTAAGC GCTATCAACA CGCTGTGCAA TAGGACAGCA GGAGAGTGCA AACGAATCAC TGCACTAATT TTTTTAATCT CGCAACTATG TACTTATCTG GCATACGCGA GGATGCCAAG TGGCATCACG TTATATGACT TTTATGAGTA TCTCTTCTGG GGGGGCGGAC ACATACTGCA GTTTGTGTTT TGTCAGACAT TAATGCTAGT GTACGCAACA CTCCTCGGCA TAGGGACATC TGATAGGACC CTACGGGGAT TATCTCTATT TAACTTACTT GCTGTTTTGC CAACTCCGGT ATTATATTTC TTTTTCTCCC CTGATAGAGA AATCCTTATA CGGGTTTTTA CTTATCACAT GCGTGTTTTA GGAAGTGTGG CACCAATGAT TTTTATTTTT TACCTACTTT CTCTTCCAAA AAAGAGAGTC GATTGGTCCA ATATCGCTGC TTTCACTTTC TCAGCATGCC TCTTCATCTA TGGTGGACTT CTAGGCTTTC TTATCAGAGA GAGCAATGTC GTTATTCCAG CTCACTACCA TGGTTCAATC ATTGGAATAA CTCTTGCTTT CATGGCATTC GCTTACCATT TTACAGGAAT ATTATCTTTA AAACTTCCAA TGTTGCAGCT CACAACATAT AGCATTGGAC AATTTGCACA CATTACAGGG CTACTTCTCA TGGGTGGATA TGGTGCACTC AGAAAAAGCG CTGGAATGGT AGGAGGAAGT ACCACGCTGT TCAAGTCCAT CTTCTTCTTC GGAGGCAGCT TGAGTATCTT ATCCGGTGGA CTGTTCGTAA TTCTCCTTAC ATCGACTCTT CTAAAGAATG AAAAAGGAAA TGAGACTCTT AAGTCCAGAA ACTCTTTATA G
|
Protein sequence | MSLFTSVMKQ RWLSLGVLAL GLSGIFSLFV VLLRLPFFIG LVPNIFDVSL VIHVTLGINA WMLAITAATI AGEGYFQKFS HNLCLLGVIF IFIAGFIPET HALKNNYIPF LNNLPYVLGT ALLSFGLSVS AINTLCNRTA GECKRITALI FLISQLCTYL AYARMPSGIT LYDFYEYLFW GGGHILQFVF CQTLMLVYAT LLGIGTSDRT LRGLSLFNLL AVLPTPVLYF FFSPDREILI RVFTYHMRVL GSVAPMIFIF YLLSLPKKRV DWSNIAAFTF SACLFIYGGL LGFLIRESNV VIPAHYHGSI IGITLAFMAF AYHFTGILSL KLPMLQLTTY SIGQFAHITG LLLMGGYGAL RKSAGMVGGS TTLFKSIFFF GGSLSILSGG LFVILLTSTL LKNEKGNETL KSRNSL
|
| |