Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0236 |
Symbol | |
ID | 3931636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | - |
Start bp | 200705 |
End bp | 201694 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637900392 |
Product | metalloendopeptidase glycoprotease family |
Protein accession | YP_506130 |
Protein GI | 88608749 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.232493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATC ATTTAATTTT AGGTGTAGAG ACAAGTTGCG ATGAAACCTC AGTCGCCATT GTTTCTGAAG AAGGGGAGGT TTGCTTTCAC GAAATTTTCA CCCAAGACCA TAGCAAATAT AATGGTGTCT ACCCGGAATT TGCATCCAGG GAGCATTTGA AAATTTTACC CCAGATACTA CGAAGGGCAG TTCAAGCGCA CGATCTTGAA AAATTAACAG CCATTGCTTG TACAGTTGGC CCAGGGTTGG TTGGATCGCT GATAGTTGGA GTGATGATGG CTCGCGGTCT TGCATTTTCA CTTAAAAAAC CTGTTTTCGG AGTAAACCAC CTCGAAGGCC ACCTACTTGC TGTGAGACTT GTAGAGAAAA TTAATTTCCC ATTTGTTTGT CTCGTGATTT CAGGAGGACA TTCTCAACTT ATCGATGCAA GAGGAATAGG TGACTATGTT CTTCTTGGAG AAACACTGGA TGATGCATTT GGTGAAGCAT TTGATAAACT AGCAACTATG CTTGGATTTA CATATCCAGG AGGAAAAACC GTAGAAAAGC TCGCAATCAA GGGTGACTCA GAACGTTTTC GTTTGCCGGC AGCAATGATA AATCAATCTG GTTGTAATTT TTCCCTATCA GGGATAAAAA CAGCTCTAAA AAAAATAATT ACTTCATTGC CCCAAATAAC AGAAAAAGAT AAGGCAGATA TTTGCGCATC ATTCCAGGCA TGCGTGGCAA GGATTATGGT CAACAAGTTG GAACAAGCCG TGAAAATTTG TGGTCATTCT AGGATCGTGT TAGCTGGGGG AGTTGGCTCC AATCGTTACA TAAGAGAAAC ACTAGAAGAG TTTGCAAAGA ATCACAACTT GTCGCTGCAC TTTCCAGAAG GTATTCTATG TACAGATAAC GCAGCAATGA TAGCTTGGGC AGCTATAGAA AGACTTAAAG CAGGCTGCAC AGAACTATCT CTGGAACCAC AACCAAGATT ATGTTGGTAG
|
Protein sequence | MNNHLILGVE TSCDETSVAI VSEEGEVCFH EIFTQDHSKY NGVYPEFASR EHLKILPQIL RRAVQAHDLE KLTAIACTVG PGLVGSLIVG VMMARGLAFS LKKPVFGVNH LEGHLLAVRL VEKINFPFVC LVISGGHSQL IDARGIGDYV LLGETLDDAF GEAFDKLATM LGFTYPGGKT VEKLAIKGDS ERFRLPAAMI NQSGCNFSLS GIKTALKKII TSLPQITEKD KADICASFQA CVARIMVNKL EQAVKICGHS RIVLAGGVGS NRYIRETLEE FAKNHNLSLH FPEGILCTDN AAMIAWAAIE RLKAGCTELS LEPQPRLCW
|
| |