Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04121 |
Symbol | |
ID | 5731289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 387831 |
End bp | 389006 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641284769 |
Product | putative L-cysteine/cystine lyase |
Protein accession | YP_001550297 |
Protein GI | 159902953 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0272358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGCTC TGGCCAACAA AAGTTATTTC AATTATGGTG GACAAGGGCC ATTGCCTCAA CCATCTCTAG AAGCAATAAT AACTAGTTGG CAAAAGATAC AAGAGTTGGG TCCTTTTACC AATAAGGTCT GGCCATATGT CAATGATGAG ATAGAGGCTA CAAGAAATAT GCTTGCAGAA ATTTGTGGTG TATCTAAGAG ACGTATTGGA TTTACAGAAA ATGTAACTAG TGGATGTGTT TTGCCCCTAT GGGGATTAAC TTTTTCGGAA GGGGACAGGA TTCTAATAAG TGATTGCGAG CATCCAGGTA TTGTTTCTGC ATGCAAAGAA CTAGCTCGTC GAAAAAGTCT CTATATAGAT ATATTCCCAG TTCAGCACCT CCACCAAGGT GTCAATAATA GTCATGAGCT AAACGACCAG TTGTTAAAAG GTTTGGATTT TGCTTTAAAT CCAAAGACAA GGCTAGTGGT TCTATCTCAT CTACTCTGGA ATACAGGTGT AATAACACCA ATTCCTTCTG TAGCAGAAAA GCTTAACAAG CATACAAACA AGCCTTTTCT TCTAGTGGAT GCAGCCCAGA GTTTTGGACA ATTGCCTATT GCAGAAGCAG CCTCTCTGGC AGATATTTAT GCATTCACTG GTCACAAGTG GGCTTGTGGG CCAGAGGGGC TAGGAGCAGT TGCCATTTCT CCTAGGGTTC TCGGCGCATC AAATCCAACT CTCATTGGAT GGAGAAGTTT AAAAAGCGAA GGAAGTATTT ATGAAAATAA TCCCAACCCT TTTCATGAAG ATGCTCGTCG TTTTGAAGTT GCTACATCAT GCATTCCATT ATTTGCGGGT TTAAGATCAT CACTGAAACT TATGGAAAAA GAAGGAACTG TTACCCAAAG ATTGCATCAG ATCCAAAGGA TGAGCAAAGC ACTTTGGTCA CAGCTCAAAG GGATTAATGG CGTAAATCCT ATTCTTGAGG GGCCTCCAGC GTCAGGACTT ATTAGCTTTT CTGTAGCCTC AAAATATTCA TCCAAGGAAA TAGTTAAAAT TCTTGGGAGA CAAAACCTTT GGATAAGGCT ACTTGAGGAT CCTACATGGC TTCGTGCTTG TGTTCATATA ACAAGCAATA CTGATGAGAT CAATAAACTG GTTAAATCTC TAAATGATTT AACTAAAGAG ATCTAA
|
Protein sequence | MPALANKSYF NYGGQGPLPQ PSLEAIITSW QKIQELGPFT NKVWPYVNDE IEATRNMLAE ICGVSKRRIG FTENVTSGCV LPLWGLTFSE GDRILISDCE HPGIVSACKE LARRKSLYID IFPVQHLHQG VNNSHELNDQ LLKGLDFALN PKTRLVVLSH LLWNTGVITP IPSVAEKLNK HTNKPFLLVD AAQSFGQLPI AEAASLADIY AFTGHKWACG PEGLGAVAIS PRVLGASNPT LIGWRSLKSE GSIYENNPNP FHEDARRFEV ATSCIPLFAG LRSSLKLMEK EGTVTQRLHQ IQRMSKALWS QLKGINGVNP ILEGPPASGL ISFSVASKYS SKEIVKILGR QNLWIRLLED PTWLRACVHI TSNTDEINKL VKSLNDLTKE I
|
| |