Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_01931 |
Symbol | menC |
ID | 5730900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 184830 |
End bp | 185798 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641284537 |
Product | putative O-succinylbenzoate synthase |
Protein accession | YP_001550078 |
Protein GI | 159902734 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01927] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.925844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTAT TCCTACAAAT TAAGCCTTTT GCTTTCCAAC TATTGATGCC TCTCAGAACT TCTCAGGGGA TTCTCCGTGA CAAAAAAGGT TTTCTTATAC ATCTACAGAA TGAAGACAAA GAATCTGGCT GGGGAGAAGT AGCACCTATG AAAAACGCAG AATTAAACTT GTGCGCAGCA ATACTTAAGA GTCTTGGGAG CACTCCCTCT AGGGAAAAAC TAGAAAGAAA TTTAGCAACT TGGCCAGGGT CATTAAGCTT TGGTATAGGC GCAGCACTTG CAGAATTAGA TTCTCTTGTG GGACATAAAT CAAGCCAAGA TTGGTTGAAA ACCTCTCAAT CAGCACTTCT CTTACCTACA GATAAATCTC CTGTCCTATT TCTTGAATCA ATATTAAAAG ACTCACAGAT AAAGAATGAG AACCTAACTA TCAAGTGGAA AGTTGGAAAT TCACCTATGG AAGTTGAAAA AAAATTGCTA GGAGAAATTT TAAGGCGACT ACCGCAAAAT GCCCATCTCA GACTCGATGC GAATGGTGGC TGGGATCGCA AACAAGCAAT GGATTGGGCA AATCATTTAT CCACTGAACC AAAACTTGAG TGGATTGAAC AACCACTTCC TGCTAATGAT ATTTCAGGCC TTGAGGAATT ATCTACTAAA ATTCCAGTAG CACTTGATGA ATCTCTTCTA CTCAATCCTG TATTGAAAGA AACTTGGCAA AGTTGGCAAA TTCGGAAGCC ATTGCTTGAA GGGGACCCAA GAGTCTTATT AAAAGAGTTA ACTAACAATG TCGGCTATAG AGTTATAAGC ACCTCGTTCG AAACTGGTAT AGGACGTCGT TGGATTCATC ATCTGGCAGC ATTACAACAA AAAGGGCCAA CGCCTACAGC TCCTGGTCTG GCACCTGGAT GGTGTCCAGA CAGTGCAATG TTTAGCGCTA ATCCAGAGTC AGTATGGGAC GCCGCATGA
|
Protein sequence | MSLFLQIKPF AFQLLMPLRT SQGILRDKKG FLIHLQNEDK ESGWGEVAPM KNAELNLCAA ILKSLGSTPS REKLERNLAT WPGSLSFGIG AALAELDSLV GHKSSQDWLK TSQSALLLPT DKSPVLFLES ILKDSQIKNE NLTIKWKVGN SPMEVEKKLL GEILRRLPQN AHLRLDANGG WDRKQAMDWA NHLSTEPKLE WIEQPLPAND ISGLEELSTK IPVALDESLL LNPVLKETWQ SWQIRKPLLE GDPRVLLKEL TNNVGYRVIS TSFETGIGRR WIHHLAALQQ KGPTPTAPGL APGWCPDSAM FSANPESVWD AA
|
| |