Gene P9515_00791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_00791 
Symbol 
ID4719956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp83993 
End bp85270 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content32% 
IMG OID640079741 
Productputative cysteine desulfurase or selenocysteine lyase 
Protein accessionYP_001010395 
Protein GI123965314 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAA TTAAAAATTT CCCGCAAATA GTCAAAAAAA ACTTTAATTT GGTAGATAAA 
AAAGATTTTC CTTTATTAGA GAATAATTTT AAAAGCAAAA ATAAAATTAT TTATCTAGAT
CACGCTGCAA CAACACAAAA GCCAAGCCAA GTTCTAGAAA AAATTGAAGA ATATTATAAA
AAATTCAATG CAAACGTTCA TAGGGGTGCT CATCAATTAA GTGCGAAAGC AACAGAAGAA
TTTGAGAATT CAAGATCTTT AGTTGCTAAA TACATTAATG CAAATTCAAC GCAAGAAATA
ATCTTTACAA GAAATGCAAC TGAAGCTATC AATCTAGTTG CAAGATCATG GGGAGAATTC
ACGTTAAAAG AGAATGATGA AATTATTTTG TCAGTAATGG AGCATCATAG TAATATTGTT
CCTTGGCAAA TGGTAGCTGC TAAAAACCAA TGCAAACTGA AATTTGTCGG AATTGATCAA
GACGGAAAAT TAGATATAAA TGATTTCAAG TCCAAATTAA CAAATAAAAC AAAACTAGTC
AGTTTATTGC ACATAAGCAA TACCTTAGGT TGCTGTAATC CAATAAAAGA AATAACCAAA
TTAGCTAAGG TTAAGGGTTC TTTAGTTCTG CTTGATGCTT GTCAAAGTTT AGCTCATCAA
AAGTTAGATA TTGATGAATT AGGTATAGAT TTTTTAGCAG GTTCAGGACA TAAATTATGT
GGACCAACAG GTATAGGTTT CTTATGGGCA AAGAAAGAGA TATTAGAGGA AATTCCTCCT
TTTTTTGGTG GAGGTGAAAT GATTCAAGAT GTTTTTGAAG ATACAAGTAC TTGGGCAGAG
CTTCCTCACA AATTTGAAGC AGGTACTCCT GCTATTGCAG AGGCTATAGG ATTAGCGGAA
GCAATTAAAT ATATCAACAA TATTGGCTTA GATCGTATCA GTGAATACGA AAAGCAAATT
ACAAAATATT TATTTGAACA ACTAAGTCAA ATTAAAGATC TTGTAATAAT AGGACCTCCA
CCAAAGATAG ACCCTAATAG AGCTTCACTA GCAACGTTTT ATATAAAGGG AATACATTCA
AACGATATCG CTGAGATTCT GGATTCAAAA GGTATTTGTA TAAGAAGCGG ACATCACTGT
TGTCAACCAT TGCATAGACA TATCGGAGTT AATTCAACAG CAAGAGTTAG CATGAACTTC
ACAACTACGA AAGACGATAT AAATGCTTTT ATCGAAAAAC TGAAAGAAAC TATTAGTTTT
TTAAGACTTA ATTCTTAA
 
Protein sequence
METIKNFPQI VKKNFNLVDK KDFPLLENNF KSKNKIIYLD HAATTQKPSQ VLEKIEEYYK 
KFNANVHRGA HQLSAKATEE FENSRSLVAK YINANSTQEI IFTRNATEAI NLVARSWGEF
TLKENDEIIL SVMEHHSNIV PWQMVAAKNQ CKLKFVGIDQ DGKLDINDFK SKLTNKTKLV
SLLHISNTLG CCNPIKEITK LAKVKGSLVL LDACQSLAHQ KLDIDELGID FLAGSGHKLC
GPTGIGFLWA KKEILEEIPP FFGGGEMIQD VFEDTSTWAE LPHKFEAGTP AIAEAIGLAE
AIKYINNIGL DRISEYEKQI TKYLFEQLSQ IKDLVIIGPP PKIDPNRASL ATFYIKGIHS
NDIAEILDSK GICIRSGHHC CQPLHRHIGV NSTARVSMNF TTTKDDINAF IEKLKETISF
LRLNS