Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07511 |
Symbol | met17 |
ID | 5730680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 655823 |
End bp | 657151 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285114 |
Product | putative O-Acetyl homoserine sulfhydrylase |
Protein accession | YP_001550636 |
Protein GI | 159903292 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.636654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0403498 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATTCTC ATCAATTCGA GACTCTTCAG TTACATGCTG GTCAAACTCC CGATTCTGTA ACTAATTCAA GGGCTGTTCC TATATACCAG ACCAGCTCTT ATGTTTTTAA TGACGCAGAG CATGGAGCTA ATTTATTTGG TCTGAAAGAG TTTGGAAATA TTTATACGCG CTTGATGAAC CCCACTACAG ATGTTTTTGA AAAACGGGTT GCCTCTCTCG AGGGTGGGGT AGCTGCTCTA GCAACCGCTT CAGGCCAATC AGCACAATTT CTAGCAATTA CGAATTGCAT GCAAGCTGGA GATAACTTTG TTTCTACTTC TTTTTTATAT GGAGGAACAT ACAATCAGTT TAAGGTTCAA TTTCCCCGAT TAGGTATCAA GGTTAAATTT GCAGAAGGAG ATGACATTAG TAGTTTTGCT TCTCAGATTG ACTCAAACAC TAAAGCTATA TATGTCGAAT CTATGGGTAA TCCAAGATTT AATATCCCAG ATTTCAGAGC TCTTTCGGAC TTAGCCAAGC AAAATGACAT TCCTTTAATC GTTGATAATA CCCTTGGTGC AGCGGGGGCT CTGATTAGAC CTTTGGAGCA TGGTGCTGAT GTTGTTGTTG AAAGTGCTAC AAAATGGATA GGTGGTCACG GCACAAGCCT TGGTGGAGTA ATTGTTGACG GAGGTACTTT TGATTGGGGC AATGGTAAGT TCCCTTTAAT GAGTCAACCA AGTGCTGCAT ATCATGGCCT AATTCATTGG GATGCATTTG GTTTTGGCAG TGATATTTGC TCCATGCTTG GAGTCCCTGA GGGAAGAAAT ATTGCATTCG CTTTGCGTGC AAGACTTGAA TGCTTAAGAG ATTGGGGTCC TGCTCTTAGC CCTTTTAACT CTTTCCTGTT ACTACAAGGC TTAGAAACTT TAAGTTTAAG AATAGAGAGA CATTGTTCTA ATTCAATGGC TCTAGCTAAT TGGCTCAATG ATCATCCGAA AGTTTCTAAT GTTAATTATC CAGGATTAGC TTCTGACCCT TATCACCAAA CTGCTAAGAA GTATCTCTCT GGAAGAGGAA TGGGATGCAT GTTGATGTTT TCTTTGAAGG GAGGTTTTGA TGATGCCGTA ACTTTTATTA ATTCATTAAA GCTTGCAAGT CACCTTGCAA ATGTAGGAGA TTCTAAAACA TTAGTTATTC ATCCTGCTTC CACAACTCAC CAACAGCTTT CACTGGAAGA GCAAGAATCA GCAGGAGTTA CTCCTACTAT GGTAAGAGTT TCAGTTGGGT TAGAGCATAT TGATGACATA ATGGCTGATT TTGATCAAGC TTTGTCGCAG ATTAATTAA
|
Protein sequence | MNSHQFETLQ LHAGQTPDSV TNSRAVPIYQ TSSYVFNDAE HGANLFGLKE FGNIYTRLMN PTTDVFEKRV ASLEGGVAAL ATASGQSAQF LAITNCMQAG DNFVSTSFLY GGTYNQFKVQ FPRLGIKVKF AEGDDISSFA SQIDSNTKAI YVESMGNPRF NIPDFRALSD LAKQNDIPLI VDNTLGAAGA LIRPLEHGAD VVVESATKWI GGHGTSLGGV IVDGGTFDWG NGKFPLMSQP SAAYHGLIHW DAFGFGSDIC SMLGVPEGRN IAFALRARLE CLRDWGPALS PFNSFLLLQG LETLSLRIER HCSNSMALAN WLNDHPKVSN VNYPGLASDP YHQTAKKYLS GRGMGCMLMF SLKGGFDDAV TFINSLKLAS HLANVGDSKT LVIHPASTTH QQLSLEEQES AGVTPTMVRV SVGLEHIDDI MADFDQALSQ IN
|
| |