Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MARTH_orf471 |
Symbol | mspJ |
ID | 6418390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycoplasma arthritidis 158L3-1 |
Kingdom | Bacteria |
Replicon accession | NC_011025 |
Strand | + |
Start bp | 409106 |
End bp | 410119 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 4 |
GC content | 33% |
IMG OID | 642715578 |
Product | massive surface protein MspJ |
Protein accession | YP_002000016 |
Protein GI | 193216774 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02184] Mycoplasma virulence family signal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.00649637 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCATG CTAAGAAAAA GAAAATTGCT ATTATTGTGC TTGCTGGTGC CGCAGCGTTG TTAGCGGCCG GAACTATTTC AGGAGTACTT TATGCACATC AAAGCGCTAC TAAGAGATCC AAAGGCAATT ATAAAAAGCC CGAAATTCAA GATATTAGTG AGTTGGAAAA AAAGATTGAG GCTATTCGTG ATAGTCACAA GCAAAATCTT AAAACTGAAG CGTGAAAAAT TCTTGAAGCT CTTAAAATCA CGATTAGAAA TGCCAAAAAA GCTAATAACG TACGCGATTT AAGCCAGGTT ATTAGTAATT TTGAACGTTT GATTCCTCTA GGAGAGGCTT ATTTAGCGGA ACTTAAAAAG CTGCCTGAAC TTGAAGCGCT AGCTAACGAT CTAAAAAATG TGATTGATTT AGCTAAAGAA GCCTTAAAAG AAGCTAAAGA AAAACTACTA GATTTGCAGC AAAAAGAAAA AATTCTAAAA GATAAGTTGC AAACTTTACT TAATAAAATT ACACAAGCAA TTGCAAAAGA GCCTAATGCC AATGATGTGG CAACAATCGA AGCTTTAATA GCCGAACTTA AGACGCTCCA AATTGAAAGT GACGATTTAG CACAATCACT TAAAGCAGCA AAATTGCTTG ACGAACTCAA ACTATTAAAC GATGCTAATT TAAAAATCAA AGAAACAATT AGCATTTTAC AAAAACGTTT AATTGCTATC AATCCCGAAA AGCAAAAAGA AATTACTAAA CAAGTAAATC AAAAAATTCT TGATCTAGAA AAATCACAAA AAGATGTTGA AAATGCTAAT GATATTTCAA CATTACCTAA CGCCATTAAA AAACTTGAAA AAGATCTAGA AGATGCTAAA TCTTTAGAAA AAGATGCGAA AGATAATGGC CTAAATGACG TTGCTAAGAA ACTACAAGAT GCCATTAACA AGGGCGAAGA AGCATTGAAA AAAGGTAAAG AAAAAAGAAC AAAAAATAAT CGAAGCCAAC AATGCTTTAA TTAA
|
Protein sequence | MSHAKKKKIA IIVLAGAAAL LAAGTISGVL YAHQSATKRS KGNYKKPEIQ DISELEKKIE AIRDSHKQNL KTEAWKILEA LKITIRNAKK ANNVRDLSQV ISNFERLIPL GEAYLAELKK LPELEALAND LKNVIDLAKE ALKEAKEKLL DLQQKEKILK DKLQTLLNKI TQAIAKEPNA NDVATIEALI AELKTLQIES DDLAQSLKAA KLLDELKLLN DANLKIKETI SILQKRLIAI NPEKQKEITK QVNQKILDLE KSQKDVENAN DISTLPNAIK KLEKDLEDAK SLEKDAKDNG LNDVAKKLQD AINKGEEALK KGKEKRTKNN RSQQCFN
|
| |