Gene NATL1_04801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04801 
Symbolsun 
ID4781299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp437072 
End bp438418 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content38% 
IMG OID640083757 
ProductSun protein (Fmu protein) 
Protein accessionYP_001014309 
Protein GI124025193 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.273951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTTTGA GTGAAAACCT GTCATCCATA AAAGGCTTAG ATGCCAGAAA AGCTGCTTGG 
GAGGTTATCC AAGCCGTAGG TGGAGGTTCA TTCGCAGATG TTGCTTTGGA AAGGATTTTT
AATCTTTATT CCTTTAAGTC GATAGATAAA GCTTTGATAA CTGAACTTTC TTATGGTGCA
ATTCGCCAAA GATATTACTT GGATTGTTGG ATTGATCATT TAGGGAAAGT ACCCGCTAAA
AAACAACCTC CTTTATTGAG ATGGCTATTG CATCTTGGGC TTTATCAGGT TCTAAAAATG
AAGAGAATAC CTCCAGCTGC TGCAATTAAC ACAACTGTAG AGCTTGCGAA AACTCATCAT
TTAAGAAAGC TAGCCCCCGT TGTTAATGGA ATCTTGCGAT CTGCTCTTAG AAGCAAAGAG
AGAGGACTCT TGTTGCCTAA ATCGAATAAT CCAAGTTTGG AATTGGCAAA AAACGAGTCA
CTTCCTGTTT GGTTTGCAGA GGAATTGATT GCTTGGAAGG GAGTCGAACA TGCTCAGCAG
ATTGCTAAAG CATTCAACAG CGTTAGTCCT ATTGATATAA GAGTGAATAA ATTGCGTGCA
GATTTAAAAG ATGTAAAAGA ACTTTTTGAT ACCTGCGGTA TTCAAAATCA ATTAATCCCA
AACTGTCCTT CCGGATTGGA GGTACGAGCT GGTATAGGTG AACCTAGACA ATGGCCTGGT
TATGAAGAAG GTAAATGGAG TGTTCAAGAT AGATCTTCAC AGTTAATTGC CCCATCATTA
GGACCTCTAC CTGGAGAAAA GATTCTTGAT GCTTGTGCTG CACCAGGCGG AAAATCAACA
CATATTGCTG AATTAATTAA TAATGAGGGC AATCTCTGGT CTGTTGATCG ATCATCCAGA
AGATCTAAAA AAATATTAGC TAACTCAGAG AGGCTTGGGA CTAAATGCTT GCAAATATTG
GTTGCTGATT CTAATGAGCT ATTACTCAAA AAGCCAGATT GGAAAGGTTT TTTTGATCGT
ATATTAATAG ATGCTCCATG CTCAGGATTG GGTACTCTTG CTCGCCACCC TGATGCAAGA
TGGAGGATGA ATCAAGATAA TATTCAGCAG CTTGTCGCTG TTCAAAGTCA GTTGCTTAAC
TCGTTAGCGC CTTTATTGAA AAATGGAGGG AAGTTGGTTT ATTCCACTTG TACTATTCAC
CCTGAAGAAA ATTCTCATCA GATAAAAAAT TTTCTTCAAT CTAAGTCTGA GTTTTTATTG
GAATATGAAA AACAGATCTG GCCTGGAGAG GGAGATAATG GAGATGGTTT TTATATTGCT
GTTTTAAATA AATTAAAAAA TCAATAA
 
Protein sequence
MLLSENLSSI KGLDARKAAW EVIQAVGGGS FADVALERIF NLYSFKSIDK ALITELSYGA 
IRQRYYLDCW IDHLGKVPAK KQPPLLRWLL HLGLYQVLKM KRIPPAAAIN TTVELAKTHH
LRKLAPVVNG ILRSALRSKE RGLLLPKSNN PSLELAKNES LPVWFAEELI AWKGVEHAQQ
IAKAFNSVSP IDIRVNKLRA DLKDVKELFD TCGIQNQLIP NCPSGLEVRA GIGEPRQWPG
YEEGKWSVQD RSSQLIAPSL GPLPGEKILD ACAAPGGKST HIAELINNEG NLWSVDRSSR
RSKKILANSE RLGTKCLQIL VADSNELLLK KPDWKGFFDR ILIDAPCSGL GTLARHPDAR
WRMNQDNIQQ LVAVQSQLLN SLAPLLKNGG KLVYSTCTIH PEENSHQIKN FLQSKSEFLL
EYEKQIWPGE GDNGDGFYIA VLNKLKNQ