Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_09121 |
Symbol | |
ID | 4717619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 783033 |
End bp | 784040 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640078625 |
Product | RNA methyltransferase TrmH, group 3 |
Protein accession | YP_001009303 |
Protein GI | 123968445 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0566] rRNA methylases |
TIGRFAM ID | [TIGR00186] rRNA methylase, putative, group 3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACT CCTCTAAAAA AAAATTCCCA GGAAAAAATA ATAAAGAATA CAAAAAAAAC TCAGATTTTG GTTATTACCC AAAAAATAAA AATCGTTCTG AAAAAATTGA TAGATTATCG AACAATTCTG ATAAATATAA GAATGTTGAA AATTTAAATA AAAATGAAAA GGATAGTACT TTTTCATCTT TAAAAAGAAA AAAGCCAACA TTTAAATCTA ATATAGGTCT TCATAATAAA AATCCTGATA TTAATCAAGA GTTTAATAAC AAGAAAAATT TTGATGATTG GATATGGGGC AAACATTCGG TTTATGAGGC TCTTAGTAGT GAAAGAGCAA TTAATAGGAT TTGGTGTACA TCGGAAATCT TTTCTTCAGA TAAATTCTAT ATTTTGCTTA AGGATCTTAA ATCAAAAGGA GTGCTTATTG AAGAAGTTTC TTGGAACAGG CTTTCGCAAT TAACTTATGG TGCTTCACAT CAAGGCATCG CATTACAGTT GGCATGTTCT AGAACAATAT CCCTAGAACA ATTAATCGAT TTTTCTAGAC ACAACTGCGC AAATCCCATA ATACTTGCAT TGGATGGTAT TACTGATCCG CATAATGTTG GTGCGATCAT TAGATCAGCG GAAGCATTTG ATTGCAAGGG CATCATCATT CCTCAGAGAA GATCCGCTGG ATTGACAGGA ACAGTAGCTA AAGTGGCTGC AGGAGCCTTA GAACACGTGC AAGTAAGTAG AGTTGTAAAC CTAAATAGAG CACTTGAGGA ACTTAAGAAA AATGGTTTTA TTGTTGTTGG CCTATCTGGA GATGGCCAAT TATCTATCTC AAATTTTCTT GAAAAAGCCC CTTTGGTAGT TATAGTCGGT TCAGAAGAAA AAGGTATTTC TTTACTTACT CAAAAAAAAT GCGATTTTCT ATTAAGTATT CCCCTTAGAG GTAAGACTTC AAGTTTAAAT GCATCTGTAG CGGCCGCTAT ATCACTATTT CACTTGACAA GTATATAA
|
Protein sequence | MKNSSKKKFP GKNNKEYKKN SDFGYYPKNK NRSEKIDRLS NNSDKYKNVE NLNKNEKDST FSSLKRKKPT FKSNIGLHNK NPDINQEFNN KKNFDDWIWG KHSVYEALSS ERAINRIWCT SEIFSSDKFY ILLKDLKSKG VLIEEVSWNR LSQLTYGASH QGIALQLACS RTISLEQLID FSRHNCANPI ILALDGITDP HNVGAIIRSA EAFDCKGIII PQRRSAGLTG TVAKVAAGAL EHVQVSRVVN LNRALEELKK NGFIVVGLSG DGQLSISNFL EKAPLVVIVG SEEKGISLLT QKKCDFLLSI PLRGKTSSLN ASVAAAISLF HLTSI
|
| |