Gene P9211_02111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02111 
SymbolrluD 
ID5731177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp203173 
End bp204156 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content40% 
IMG OID641284555 
Productputative pseudouridylate synthase specific to ribosomal large subunit 
Protein accessionYP_001550096 
Protein GI159902752 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATA GTACTAAGCA ATGCTTTGGC GAAGGAGAAG GAGAACTAGT AATCCTCCAT 
TATCCCAAGC CACTTCCAAT GAGACTTGAT CGCTGGCTAG TGAGTCAAAG GTCAGAACAA
AGTCGTGCAC GTATTCAGAA ATTTATTAAT GAAGGATTAG TCAAAGTGAA TGAAAAAATA
GGTAAGGCAA AAACACCTCT TAGGCAAGGA GACGAAATTA AACTATGGCT ACCACCACCC
GAGCCACTCC CATATCTGAA ACCAGAAAAA ATTAGCTTGG ACATATTATA TGAAGACAAA
TATTTAATCG TAATAAATAA GCAAGCAGGC ATAGCCGTCC ATCCTGCACC AGGAAATAAA
TTCGGTACGT TAGTCAATGG CTTACTCCAT CATTGCAATG ATCTCCCAGG AATTAGTGGA
AAACTTAGGC CAGGAATTGT ACATCGTCTA GATAAAGACA CAACAGGCTG CATTGTGGTT
GCAAAGACAC AAGAAGCATT AGTGAATCTA CAGAAACAAA TTCAGACACG AGTCGCATCT
AGGATTTACC TTGCTATTGT TCATGGCGTA CCCAAGGGAG ACAAGGGGAC AATCATTGGA
GCAATAGGAC GGCATCCAGT CGATCGAAAA AAATACGCCG TTGTAACCAA TGAGATGGCC
AGATATGCTT GTACTCATTG GGAGCTTAAA GAAAGACTTG GCGATTACTC ACTCCTAAGC
TTCAAATTAG ATACAGGGAG AACTCATCAA ATCCGTGTAC ATTGTGCCCA TTTTGGTCAC
CCCATCCTTG GAGACCAAAC ATATAGTCGC TGCAAAAAAT TGCCTAATGG AGTCTCCAGT
CAAGTTTTAC ATGCTGTTAA ACTTGGTCTA AAGCATCCAC ATAACAATGA AACAATGCTT
TTTGAAGCAC CATTGCCCAA TACATTTAAA AATGTTTTGA CTCGATTACA AAAGAGACTA
ATTACTAATG ACAATAATTC ATAG
 
Protein sequence
MEDSTKQCFG EGEGELVILH YPKPLPMRLD RWLVSQRSEQ SRARIQKFIN EGLVKVNEKI 
GKAKTPLRQG DEIKLWLPPP EPLPYLKPEK ISLDILYEDK YLIVINKQAG IAVHPAPGNK
FGTLVNGLLH HCNDLPGISG KLRPGIVHRL DKDTTGCIVV AKTQEALVNL QKQIQTRVAS
RIYLAIVHGV PKGDKGTIIG AIGRHPVDRK KYAVVTNEMA RYACTHWELK ERLGDYSLLS
FKLDTGRTHQ IRVHCAHFGH PILGDQTYSR CKKLPNGVSS QVLHAVKLGL KHPHNNETML
FEAPLPNTFK NVLTRLQKRL ITNDNNS