Gene NATL1_09311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09311 
Symbol 
ID4779223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp858043 
End bp859371 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content38% 
IMG OID640084208 
ProductRNA methyltransferase TrmH, group 3 
Protein accessionYP_001014754 
Protein GI124025638 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0566] rRNA methylases 
TIGRFAM ID[TIGR00186] rRNA methylase, putative, group 3 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0566162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATC GCTTCAATAA AGACACAAGG AATTCAAAAA ACTCTTCATC TAATAAACGA 
AGTAGTAATT ATTCTCGTCA AGATAATGAC TCCAAAAAGT CAAACTTTAA TGATAGAAGA
AGAACCAATA ACAAAATAAA TCGTCATTCT TCTGATCAAG TAAATCAATA TTCATTAGGT
AAAGAATTTT CTGATCGATC AAGAAGTAGT AACGAGGCTA ATTCCAACTA CAGAGGATCT
AATCGATTCG AGAGGAAATC GACCAACTCT TCGTATAGAA ATCAAGACTC TCAGGAAACA
AACAATTACA GAGGATCTAA TCGATTCGAG AGGAAATCGA CCAACTCTTC GTATAGAAAT
CAAGACTCTC AGGAAACAAA CAATTACAGA GGATCTAATC GATTCGAGAG GAAATCGACC
AACCCTGCGT ACAGAAATAA AAATTCTCAG AGAGCTAGCA GTTATAAAAG AGAAGAAAAT
AATGAACCTC TTTCATATTC TGAAAGCTTT AGCAAAACAT TAAGTGATGA TCTGATTTGG
GGCCGCCATT CAACTGAGGC AGCCCTTGTT GGCGGCAGGG CAATTCACAG GATTTGGTGT
ACCTCCGAAT TACGCAGTAC ACCAAAGTTT TTTCAACTTC TCAAAGATCA AAAAGCTTCT
GGGGTCTTAG TTGAAGAAGT TTCATGGTCA AGGCTTGGCC AGCTCACAAA TGGTGCAGTC
CATCAAGGAA TAGTTTTACA AATTGCCGCA TCAAAAACAC TTGACTTGAA GAATTTAATA
GATGCTTGCA AAGCTTTTGG TGATTCATCA TTGCTCTTAG CTTTAGATGG CTTAACTGAT
CCTCAGAATC TTGGGGCAAT TATTCGATCT GCCGAAGCCC TCGGTGCTCA AGGATTAATC
CTTCCACAAA GACGTAGTGC AGGATTAACA GGATCCGTAG CAAAAGTTGC CGCTGGAGCT
CTGGAACATT TGCCTGTAGC AAGAGTTGTT AATTTAAATA GGTCTTTGGA GAAATTGAAA
GATGAAGGTT ATACCGTTGT TGGCCTGGCG GAGGAGGGAT CATCTACTTT ATCTGAAATC
AAATTTCAAG GTCCTTTAGT AGTAGTAGTT GGGTCTGAAG ATAAAGGAAT TTCTCTAATA
ACTAGAAGAT TATGTGATCA GTTAGTAAGA ATTCCTCTTA AGGGAGTCAC TACAAGCCTA
AATGCATCAG TTGCTACGTC TATTTTCTTA TATGAAGTTG CTAGATCCAA ATGGATGCGC
TCAATCTCTG GACAAGACCC TTCTCCTAGA TTATTGAAAC CTCAGATTTC ATCTGAAAAG
ATTAACTAA
 
Protein sequence
MSYRFNKDTR NSKNSSSNKR SSNYSRQDND SKKSNFNDRR RTNNKINRHS SDQVNQYSLG 
KEFSDRSRSS NEANSNYRGS NRFERKSTNS SYRNQDSQET NNYRGSNRFE RKSTNSSYRN
QDSQETNNYR GSNRFERKST NPAYRNKNSQ RASSYKREEN NEPLSYSESF SKTLSDDLIW
GRHSTEAALV GGRAIHRIWC TSELRSTPKF FQLLKDQKAS GVLVEEVSWS RLGQLTNGAV
HQGIVLQIAA SKTLDLKNLI DACKAFGDSS LLLALDGLTD PQNLGAIIRS AEALGAQGLI
LPQRRSAGLT GSVAKVAAGA LEHLPVARVV NLNRSLEKLK DEGYTVVGLA EEGSSTLSEI
KFQGPLVVVV GSEDKGISLI TRRLCDQLVR IPLKGVTTSL NASVATSIFL YEVARSKWMR
SISGQDPSPR LLKPQISSEK IN