Gene A9601_04811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_04811 
Symbolsun 
ID4717179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp416073 
End bp417389 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content29% 
IMG OID640078193 
ProductSun protein (Fmu protein) 
Protein accessionYP_001008876 
Protein GI123968018 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCATAG GATATTTACA AAGAAAGGCA GCTTGGGAAA TTTTATTAAA AGTTAGTTCG 
GGTGATTTTT CTGATCATGC TCTTGAAAAG GTTTTAAAAA ATTATCAATT TAATCCTCTT
GATATAGCTT TTATTACGGA ATTATCTTTT GGATGCATAA GGTATAGAAA ATTTCTTGAT
CTTTGGACGG ATCATACATC AAAAATTACT CATAAAAAGC AGCCTCCAAA GTTAAGATGG
CTTCTACATA TAGGTTTATA TCAACTATTG AAAATGGATA AAATTCCATT TCCTGCTGCT
ATTACTACGA CTGTAGAAGT AGCTAAAAAA ACAGATTTAA ATGGTTTAGC GGGAACTGTA
AATGCGATAT TGAGAAATGC ATCAAGAAAA TTAGAACAAA AAATATTTCC GGAATTATCA
TCTGATAGAA AAGAAAGAAT TTCATATCTT GAATCATTCC CATTATGGCT TGTGAAGGAT
CTTTATAAAT GGGTCGGTAA TAGTGAGGGT GAAAATATCA TTAGGGCATT TAATAAAAAA
CCATCAATTG ATTTGAGAAT TAACCAATTA AAAACTAATT TAGATAACTT TTTGAAAGTA
CTTCATGAAA ATAAAATTGA TGCTGAAATT ATTAATGATT TAAATAATGG AATTACTTTA
AAATCTAATC CAAGATCTAT AAAAAATTTA CCAGGATATA GTGATGGGCT TTGGACAATT
CAAGATAGAT CTTCTCAATG GATAGCACCT CTCTTAAATC CAAAAGAAGG TGAAAAGATT
TTAGATGCTT GTGCAGCTCC AGGAAGTAAG TCTACCCACC TTGCAGAATT AACAAATGAT
AGTGCTGAAA TAATTGCCGT AGATAGATCA GCAAAAAGAT TAAAAATACT GCAATCAAAT
TTAGAAAGGT TAAATTTGAA ATCTGTTAAT ACCCTTAAGG CTGATGCTAC GAGGTTGATT
GAATTAAATC CTAAGTTTAT TTCTTATTTT GATAAGATTT TATTAGATGC TCCATGTTCA
GGCATTGGAA CTCTTTCCAG GAATCCAGAT TCTAGATGGT CTTTAAGTAA AGAAAAAATA
AAATCTTTAA CTTTATTACA GGGAAAACTT TTGGAGAGTA TTTTACCTCT TTTGAAAAAA
GATGGCACTT TAGTTTATTC AACTTGTACT ATTTGTCCCG ATGAAAATAA TCTATTAATT
GAACGATTTA TTGAAAAAAA CAAAACTTTA AAATTGGTTA GCCAAAAGCA AATTTTACCT
AGCTTGGATT ATCCTGGTGA TGGATTTTAT TCTGCAATAA TTTCTTATAA ATCTTAA
 
Protein sequence
MSIGYLQRKA AWEILLKVSS GDFSDHALEK VLKNYQFNPL DIAFITELSF GCIRYRKFLD 
LWTDHTSKIT HKKQPPKLRW LLHIGLYQLL KMDKIPFPAA ITTTVEVAKK TDLNGLAGTV
NAILRNASRK LEQKIFPELS SDRKERISYL ESFPLWLVKD LYKWVGNSEG ENIIRAFNKK
PSIDLRINQL KTNLDNFLKV LHENKIDAEI INDLNNGITL KSNPRSIKNL PGYSDGLWTI
QDRSSQWIAP LLNPKEGEKI LDACAAPGSK STHLAELTND SAEIIAVDRS AKRLKILQSN
LERLNLKSVN TLKADATRLI ELNPKFISYF DKILLDAPCS GIGTLSRNPD SRWSLSKEKI
KSLTLLQGKL LESILPLLKK DGTLVYSTCT ICPDENNLLI ERFIEKNKTL KLVSQKQILP
SLDYPGDGFY SAIISYKS