Gene A9601_09121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_09121 
Symbol 
ID4717619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp783033 
End bp784040 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content33% 
IMG OID640078625 
ProductRNA methyltransferase TrmH, group 3 
Protein accessionYP_001009303 
Protein GI123968445 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0566] rRNA methylases 
TIGRFAM ID[TIGR00186] rRNA methylase, putative, group 3 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACT CCTCTAAAAA AAAATTCCCA GGAAAAAATA ATAAAGAATA CAAAAAAAAC 
TCAGATTTTG GTTATTACCC AAAAAATAAA AATCGTTCTG AAAAAATTGA TAGATTATCG
AACAATTCTG ATAAATATAA GAATGTTGAA AATTTAAATA AAAATGAAAA GGATAGTACT
TTTTCATCTT TAAAAAGAAA AAAGCCAACA TTTAAATCTA ATATAGGTCT TCATAATAAA
AATCCTGATA TTAATCAAGA GTTTAATAAC AAGAAAAATT TTGATGATTG GATATGGGGC
AAACATTCGG TTTATGAGGC TCTTAGTAGT GAAAGAGCAA TTAATAGGAT TTGGTGTACA
TCGGAAATCT TTTCTTCAGA TAAATTCTAT ATTTTGCTTA AGGATCTTAA ATCAAAAGGA
GTGCTTATTG AAGAAGTTTC TTGGAACAGG CTTTCGCAAT TAACTTATGG TGCTTCACAT
CAAGGCATCG CATTACAGTT GGCATGTTCT AGAACAATAT CCCTAGAACA ATTAATCGAT
TTTTCTAGAC ACAACTGCGC AAATCCCATA ATACTTGCAT TGGATGGTAT TACTGATCCG
CATAATGTTG GTGCGATCAT TAGATCAGCG GAAGCATTTG ATTGCAAGGG CATCATCATT
CCTCAGAGAA GATCCGCTGG ATTGACAGGA ACAGTAGCTA AAGTGGCTGC AGGAGCCTTA
GAACACGTGC AAGTAAGTAG AGTTGTAAAC CTAAATAGAG CACTTGAGGA ACTTAAGAAA
AATGGTTTTA TTGTTGTTGG CCTATCTGGA GATGGCCAAT TATCTATCTC AAATTTTCTT
GAAAAAGCCC CTTTGGTAGT TATAGTCGGT TCAGAAGAAA AAGGTATTTC TTTACTTACT
CAAAAAAAAT GCGATTTTCT ATTAAGTATT CCCCTTAGAG GTAAGACTTC AAGTTTAAAT
GCATCTGTAG CGGCCGCTAT ATCACTATTT CACTTGACAA GTATATAA
 
Protein sequence
MKNSSKKKFP GKNNKEYKKN SDFGYYPKNK NRSEKIDRLS NNSDKYKNVE NLNKNEKDST 
FSSLKRKKPT FKSNIGLHNK NPDINQEFNN KKNFDDWIWG KHSVYEALSS ERAINRIWCT
SEIFSSDKFY ILLKDLKSKG VLIEEVSWNR LSQLTYGASH QGIALQLACS RTISLEQLID
FSRHNCANPI ILALDGITDP HNVGAIIRSA EAFDCKGIII PQRRSAGLTG TVAKVAAGAL
EHVQVSRVVN LNRALEELKK NGFIVVGLSG DGQLSISNFL EKAPLVVIVG SEEKGISLLT
QKKCDFLLSI PLRGKTSSLN ASVAAAISLF HLTSI