Gene P9211_04121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04121 
Symbol 
ID5731289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp387831 
End bp389006 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content40% 
IMG OID641284769 
Productputative L-cysteine/cystine lyase 
Protein accessionYP_001550297 
Protein GI159902953 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0272358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCTC TGGCCAACAA AAGTTATTTC AATTATGGTG GACAAGGGCC ATTGCCTCAA 
CCATCTCTAG AAGCAATAAT AACTAGTTGG CAAAAGATAC AAGAGTTGGG TCCTTTTACC
AATAAGGTCT GGCCATATGT CAATGATGAG ATAGAGGCTA CAAGAAATAT GCTTGCAGAA
ATTTGTGGTG TATCTAAGAG ACGTATTGGA TTTACAGAAA ATGTAACTAG TGGATGTGTT
TTGCCCCTAT GGGGATTAAC TTTTTCGGAA GGGGACAGGA TTCTAATAAG TGATTGCGAG
CATCCAGGTA TTGTTTCTGC ATGCAAAGAA CTAGCTCGTC GAAAAAGTCT CTATATAGAT
ATATTCCCAG TTCAGCACCT CCACCAAGGT GTCAATAATA GTCATGAGCT AAACGACCAG
TTGTTAAAAG GTTTGGATTT TGCTTTAAAT CCAAAGACAA GGCTAGTGGT TCTATCTCAT
CTACTCTGGA ATACAGGTGT AATAACACCA ATTCCTTCTG TAGCAGAAAA GCTTAACAAG
CATACAAACA AGCCTTTTCT TCTAGTGGAT GCAGCCCAGA GTTTTGGACA ATTGCCTATT
GCAGAAGCAG CCTCTCTGGC AGATATTTAT GCATTCACTG GTCACAAGTG GGCTTGTGGG
CCAGAGGGGC TAGGAGCAGT TGCCATTTCT CCTAGGGTTC TCGGCGCATC AAATCCAACT
CTCATTGGAT GGAGAAGTTT AAAAAGCGAA GGAAGTATTT ATGAAAATAA TCCCAACCCT
TTTCATGAAG ATGCTCGTCG TTTTGAAGTT GCTACATCAT GCATTCCATT ATTTGCGGGT
TTAAGATCAT CACTGAAACT TATGGAAAAA GAAGGAACTG TTACCCAAAG ATTGCATCAG
ATCCAAAGGA TGAGCAAAGC ACTTTGGTCA CAGCTCAAAG GGATTAATGG CGTAAATCCT
ATTCTTGAGG GGCCTCCAGC GTCAGGACTT ATTAGCTTTT CTGTAGCCTC AAAATATTCA
TCCAAGGAAA TAGTTAAAAT TCTTGGGAGA CAAAACCTTT GGATAAGGCT ACTTGAGGAT
CCTACATGGC TTCGTGCTTG TGTTCATATA ACAAGCAATA CTGATGAGAT CAATAAACTG
GTTAAATCTC TAAATGATTT AACTAAAGAG ATCTAA
 
Protein sequence
MPALANKSYF NYGGQGPLPQ PSLEAIITSW QKIQELGPFT NKVWPYVNDE IEATRNMLAE 
ICGVSKRRIG FTENVTSGCV LPLWGLTFSE GDRILISDCE HPGIVSACKE LARRKSLYID
IFPVQHLHQG VNNSHELNDQ LLKGLDFALN PKTRLVVLSH LLWNTGVITP IPSVAEKLNK
HTNKPFLLVD AAQSFGQLPI AEAASLADIY AFTGHKWACG PEGLGAVAIS PRVLGASNPT
LIGWRSLKSE GSIYENNPNP FHEDARRFEV ATSCIPLFAG LRSSLKLMEK EGTVTQRLHQ
IQRMSKALWS QLKGINGVNP ILEGPPASGL ISFSVASKYS SKEIVKILGR QNLWIRLLED
PTWLRACVHI TSNTDEINKL VKSLNDLTKE I