Gene P9211_00781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00781 
Symbol 
ID5731804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp83904 
End bp85163 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content34% 
IMG OID641284421 
Productputative cysteine desulfurase or selenocysteine lyase 
Protein accessionYP_001549963 
Protein GI159902619 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.674845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACTA TTAATGAAAA AAATCTAGCT AAATTAACAA GAAAGGACTT TCCTCTTTTC 
TCTAGTGAAG ATCTCAAAAA TAACCCTTTG GTTTATTTAG ACCATGCCGC TACAAGTCAA
AAGCCAAAGA AAGTTATTGA AGCTATCAGC CATTATTATA AATATGAAAA TGCAAATGTT
CACCGGGGTG CCCATCAATT AAGTGCAAAA GCAACAGAGG CTTTTGAAAA AGCACGTACA
ATAACATCCA AATTTATTAA TGCATCTTCT GAAAGAGAAA TTGTTTTTAC AAGAAATGCC
ACTGAAGCTA TAAACTTAGT CGCTCATTCC TGGGGTGATT CAGAGCTAAA AGAGGGCGAT
GAAATCTTAA TTAGCTTAAT GGAGCATCAT AGTAATATCG TACCTTGGCA ATTACTAGCT
GAAAGAAAAA AATGCAATTT AAGATATATA GGAATCACTT CAAGTGGACA ATTAGATCTT
GAAGATGCCT ATAGTAAATT AAATGAAAAA ACAAGGATAT TGAGTTTACA ACATATTAGT
AATACATTGG GATGTTGTAA TCCTATTGCT GAAATTACTC AGAAGGCACA TAGTGCTGAT
GCTTTAATTC TTGTCGATGC TTGTCAAAGT CTTGCTCATC AACCTATCGA TGTTAAAAAA
TTAAATATTG ATTTTCTAGC AGGCTCTTCT CATAAATTAT GTGGACCAAC AGGATGTGGT
TTTTTATGGG CAAAAGAAAA TCTATTGGAA AAAATGCCTC CTTTTTTAGG AGGAGGGGAA
ATGATACAAG AAGTGTCTTT AAATAAAAGT AGCTGGGCAG ATTTGCCTCA CAAATTTGAA
GCAGGTACTC CAGCTATAGG AGAGGCAATT GGAATGGGAG CCGCCTTAAC TTATCTCGAG
TCCATAGGCC TAAATAATAT ACATGCTTAT GAAAAAAAAC TTACTAAATA TCTTTTCCAG
CAATTAGAAA CTATTGAAGG TATTAATATT ATTGGTCCAA ATCCAAAAAT ACAAAGTAAT
CGAGCTCCCC TCGCAACCTT TACAATAAAT AAATTGCATT CAAATGATAT TGCATCCCTT
CTAGATACAA GCAATATATG CATTCGAAGT GGACATCATT GTTGCCAACC ACTGCACAAA
CACTATGGAA TTAGTTCATC AGCTAGAGCA AGTCTTAGTT TTACATCAAC AATAGATGAA
ATAGATACTT TTGTTTCTCA ACTAATATCC AGCATAAATT TCCTACAAGA AAATTCTTAG
 
Protein sequence
MMTINEKNLA KLTRKDFPLF SSEDLKNNPL VYLDHAATSQ KPKKVIEAIS HYYKYENANV 
HRGAHQLSAK ATEAFEKART ITSKFINASS EREIVFTRNA TEAINLVAHS WGDSELKEGD
EILISLMEHH SNIVPWQLLA ERKKCNLRYI GITSSGQLDL EDAYSKLNEK TRILSLQHIS
NTLGCCNPIA EITQKAHSAD ALILVDACQS LAHQPIDVKK LNIDFLAGSS HKLCGPTGCG
FLWAKENLLE KMPPFLGGGE MIQEVSLNKS SWADLPHKFE AGTPAIGEAI GMGAALTYLE
SIGLNNIHAY EKKLTKYLFQ QLETIEGINI IGPNPKIQSN RAPLATFTIN KLHSNDIASL
LDTSNICIRS GHHCCQPLHK HYGISSSARA SLSFTSTIDE IDTFVSQLIS SINFLQENS