Gene P9211_10051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10051 
Symbolsir 
ID5730038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp896643 
End bp898439 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content38% 
IMG OID641285372 
Productsulfite reductase subunit beta 
Protein accessionYP_001550890 
Protein GI159903546 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID[TIGR02042] ferredoxin-sulfite reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.155233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.609504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTAAAG AAGCACTAAG TTTGCCTGAT ATCGACGATT CGCCTCTCTC TCCTTGTGTT 
GCCAATGGTC AAGACAGAAC TAAATTTGAA CAATTCAAAG CAGATAGTGA TTACTTAAAA
GAGCCCCTTA GAACAGAATT ACAAAACAAT AGTGATCATT TCAGTAACGA TGCTGTACAA
CTTTTAAAGT TCCATGGAAG TTATCAGCAG GACAATAGAG ATACCAGGCA AAAAGGAGTT
GAGAAAGACT GGCAAATGAT GCTGAGGCTA AGGAGCCCAG GCGGGCATAT ACCTCCGGAA
TTATTTGTGG CATTAGACCA TCTTTCAGAT CAGCTAGGCA ATGGAACTCT ACGAGCAACA
ACACGACAAG CATTCCAAAT GCATGGGATT AAGAAAGAAA GCCTAAAAAC TGTGATCGAA
ACAATTATTA GATCAATGGG TTCAACACTT GCGGCTTGCG GTGATATAAA CAGAAATGTA
ATGGCCCCAG CAGCGCCCTT TACATATGGT GACTACCCAG TTGCGCGACG TTTAGCAAAT
CAAATTGCAG ATGTACTTAG CCCCTCATAC GCTGAGAAAA CTTATCTCGA GCTTTGGGTT
GACGGAGATA TAAGCTATAA AATAAGCCCT TCTAATGAAG TTAAAAAAGT AAGAAAGAAT
CAATATAAGG AAGGTGTCTA TAGTGGAGAC ATTAAAGAAC CACTATACGG ATCAACTTAT
TTACCTAGAA AATTTAAATG TGCAGTTACA GTTCCTGGTG ATAATTCGGT TGACTTACTA
ACTCAAGACA TTGGTTTGGT TGTTTTTACT AATGCGAAAA ATAAACTTAT AGGCTGCAAT
GTTTACGTTG GCGGAGGAAT GGGGCGTACA CATAATAACG AAGAGACTTT TGCAAGATCT
GCTGAACCAA TTGGTTTCGT ATCTGCAGAA TATGTGCTCG AGTTGGTTCA ATCTATTCTT
GCTCTCCAAA GAGACTATGG TGACAGGAAG GTAAGAAGAC ATGCGAGAAT GAAGTATCTA
ATTAATGACA AAGGCATTGA GTGGTTTATT GACAAACTAA AAAATAATTA CTTCAAGTAC
CCTATAAAAA GTTTAAGGAA AGAACCAACC TCTAAATTAC TTGATTATCT TGGTTGGCAT
AAACAATCTC ATGATCTATA TTTTGTAGGC ATACCTCTTT TATCAGGTAG ACTTAGTGGA
GAGTATAAAA AAGCCATTTG TAAACTTGTT AATAAATTTA AATTAGATAT TCAGCTAACA
CCTAATCAAG ATCTACTACT CTGCAATATT GGAAGTTATC AAAGGTCATC TATTAAAAAG
GAATTAAGCA ATATTGGTAT TATTAAACCT GATTCACCTG AACCTATTCA AAGGCATGCG
TTAGCCTGCC CTGCTCTACC ACTTTGTGGT TTAGCTGTTA CAGAAGCTGA GCGAATCCTG
CCAGAAATTT TAGATCGTAT AAACACCCAG CTTTCTGACC TGAGGATTCA AAAGACGTTG
CTATTTCGTA TGACAGGTTG TCCAAATGGA TGCGCCAGGC CATACATGGC GGAACTAGCT
TTGGTAGGTA GTGGCTTAGA TCAATATCAA CTATGGCTTG GTGGTAGTCC GAACTTGCAA
AGGCTTGCGA AGCCTTTTAT TCAAAGGATG CCTTTGACAT CACTAGAAGA AACACTTAAG
CCTCTCTTTA TTAGTTGGAA GAATTCTAAA AAAGAAATTA GTTTCGGTGA CCATATGAAT
GAACTAGGAG ATCAAAATAT AATGGAGTTA CTATCACTAA AAGACAAGGA GCCATAA
 
Protein sequence
MTKEALSLPD IDDSPLSPCV ANGQDRTKFE QFKADSDYLK EPLRTELQNN SDHFSNDAVQ 
LLKFHGSYQQ DNRDTRQKGV EKDWQMMLRL RSPGGHIPPE LFVALDHLSD QLGNGTLRAT
TRQAFQMHGI KKESLKTVIE TIIRSMGSTL AACGDINRNV MAPAAPFTYG DYPVARRLAN
QIADVLSPSY AEKTYLELWV DGDISYKISP SNEVKKVRKN QYKEGVYSGD IKEPLYGSTY
LPRKFKCAVT VPGDNSVDLL TQDIGLVVFT NAKNKLIGCN VYVGGGMGRT HNNEETFARS
AEPIGFVSAE YVLELVQSIL ALQRDYGDRK VRRHARMKYL INDKGIEWFI DKLKNNYFKY
PIKSLRKEPT SKLLDYLGWH KQSHDLYFVG IPLLSGRLSG EYKKAICKLV NKFKLDIQLT
PNQDLLLCNI GSYQRSSIKK ELSNIGIIKP DSPEPIQRHA LACPALPLCG LAVTEAERIL
PEILDRINTQ LSDLRIQKTL LFRMTGCPNG CARPYMAELA LVGSGLDQYQ LWLGGSPNLQ
RLAKPFIQRM PLTSLEETLK PLFISWKNSK KEISFGDHMN ELGDQNIMEL LSLKDKEP