Gene NATL1_16671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16671 
Symbol 
ID4780004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1359604 
End bp1360680 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content37% 
IMG OID640084950 
Productputative tetrapyrrole methylase family protein 
Protein accessionYP_001015489 
Protein GI124026373 
COG category[R] General function prediction only 
COG ID[COG0313] Predicted methyltransferases 
TIGRFAM ID[TIGR00096] probable S-adenosylmethionine-dependent methyltransferase, YraL family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.774754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTG CAGATACTGG GCCGTCACCG CCTTCCTCTA CAGTTAATGA TTTAGGGAAG 
CCATATGGAG GTTCATCACC TCTTGCATAT GCCATCAAAA CATCAGCGGA AGCCCAGCTA
AGCTTTCTAA GCACATCCAA TAAAGTCGGT AATTCGACCT CTTTTGGCAC AACAAGATCA
ATAGACATAA TGGATTTAAT TGATGAAAAT CAACTTCAAC AAGAAAGATC AGAACCTTGT
CCAGGGACTC TTTACCTAGT GGGTACGCCA ATAGGTAACT TAGGCGATTT ATCACCAAGG
GCAAAGTCAA TACTTAAGAA TGTTTCACGA ATAGCTTGTG AGGATACACG TAGAAGCGGA
CAATTATTGA AAATAATAAA ATCTGAAGTT CCACTACTGA GCTATCACAA ACATAATTTC
AAAAGTCGTC AATCTCAATT ATTAGAAATC CTAGAATGTG GTGGGAGTCT TGCCTTAATT
AGTGATGCTG GCTTGCCAGG TATAAATGAC CCAGGAGAAG AACTTGTTCA TGCTGCAAGA
TCTAATAGTT ATGAAGTGAT TTGCGTACCT GGACCTTGTG CAGCAACAAC AGCGTTAGTA
ATAAGTGGAT TGCCCTCAGA GAGATTTTGC TTTGAAGGGT TTCTACCAAA AAAACAAAGC
CTTAGGAAGA AACGTTTGGA AGATATTTCT CAAGAACAAA GAACGACTGT AATCTATGAA
TCACCTCATC AATTAATCAA ACTTTTAGAA GATTTATCTA TTTCATGCGG AAAAGAACGC
CCTATTCAAA TCGCGAGAGA ACTTACAAAA AGATATGAAG AGTCAATAGG TAAGACAATT
GAGGAGGTAA CAAAATATTT TATTACAAAT AAGCCAAAAG GAGAATTCAC CATAGTGCTA
GGAGGAAATA ATAATAAGCA ACAGAATAAA GTAAGTGAAT CAGAAGCCTT AAATAAGCTT
AATACATTAA TAAATCAAGG AGAAAAATCT AATATTGCAG CGCAAAAAGT CGCAGAAGAA
ACAGGATATA AAAAAAAATG GCTTTACTCC AAATTACACA AGAGGCTTGA CAAATAA
 
Protein sequence
MSAADTGPSP PSSTVNDLGK PYGGSSPLAY AIKTSAEAQL SFLSTSNKVG NSTSFGTTRS 
IDIMDLIDEN QLQQERSEPC PGTLYLVGTP IGNLGDLSPR AKSILKNVSR IACEDTRRSG
QLLKIIKSEV PLLSYHKHNF KSRQSQLLEI LECGGSLALI SDAGLPGIND PGEELVHAAR
SNSYEVICVP GPCAATTALV ISGLPSERFC FEGFLPKKQS LRKKRLEDIS QEQRTTVIYE
SPHQLIKLLE DLSISCGKER PIQIARELTK RYEESIGKTI EEVTKYFITN KPKGEFTIVL
GGNNNKQQNK VSESEALNKL NTLINQGEKS NIAAQKVAEE TGYKKKWLYS KLHKRLDK