Gene P9211_01931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_01931 
SymbolmenC 
ID5730900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp184830 
End bp185798 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content41% 
IMG OID641284537 
Productputative O-succinylbenzoate synthase 
Protein accessionYP_001550078 
Protein GI159902734 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.925844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTAT TCCTACAAAT TAAGCCTTTT GCTTTCCAAC TATTGATGCC TCTCAGAACT 
TCTCAGGGGA TTCTCCGTGA CAAAAAAGGT TTTCTTATAC ATCTACAGAA TGAAGACAAA
GAATCTGGCT GGGGAGAAGT AGCACCTATG AAAAACGCAG AATTAAACTT GTGCGCAGCA
ATACTTAAGA GTCTTGGGAG CACTCCCTCT AGGGAAAAAC TAGAAAGAAA TTTAGCAACT
TGGCCAGGGT CATTAAGCTT TGGTATAGGC GCAGCACTTG CAGAATTAGA TTCTCTTGTG
GGACATAAAT CAAGCCAAGA TTGGTTGAAA ACCTCTCAAT CAGCACTTCT CTTACCTACA
GATAAATCTC CTGTCCTATT TCTTGAATCA ATATTAAAAG ACTCACAGAT AAAGAATGAG
AACCTAACTA TCAAGTGGAA AGTTGGAAAT TCACCTATGG AAGTTGAAAA AAAATTGCTA
GGAGAAATTT TAAGGCGACT ACCGCAAAAT GCCCATCTCA GACTCGATGC GAATGGTGGC
TGGGATCGCA AACAAGCAAT GGATTGGGCA AATCATTTAT CCACTGAACC AAAACTTGAG
TGGATTGAAC AACCACTTCC TGCTAATGAT ATTTCAGGCC TTGAGGAATT ATCTACTAAA
ATTCCAGTAG CACTTGATGA ATCTCTTCTA CTCAATCCTG TATTGAAAGA AACTTGGCAA
AGTTGGCAAA TTCGGAAGCC ATTGCTTGAA GGGGACCCAA GAGTCTTATT AAAAGAGTTA
ACTAACAATG TCGGCTATAG AGTTATAAGC ACCTCGTTCG AAACTGGTAT AGGACGTCGT
TGGATTCATC ATCTGGCAGC ATTACAACAA AAAGGGCCAA CGCCTACAGC TCCTGGTCTG
GCACCTGGAT GGTGTCCAGA CAGTGCAATG TTTAGCGCTA ATCCAGAGTC AGTATGGGAC
GCCGCATGA
 
Protein sequence
MSLFLQIKPF AFQLLMPLRT SQGILRDKKG FLIHLQNEDK ESGWGEVAPM KNAELNLCAA 
ILKSLGSTPS REKLERNLAT WPGSLSFGIG AALAELDSLV GHKSSQDWLK TSQSALLLPT
DKSPVLFLES ILKDSQIKNE NLTIKWKVGN SPMEVEKKLL GEILRRLPQN AHLRLDANGG
WDRKQAMDWA NHLSTEPKLE WIEQPLPAND ISGLEELSTK IPVALDESLL LNPVLKETWQ
SWQIRKPLLE GDPRVLLKEL TNNVGYRVIS TSFETGIGRR WIHHLAALQQ KGPTPTAPGL
APGWCPDSAM FSANPESVWD AA