Gene Syncc9902_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2118 
Symbol 
ID3742767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp2019288 
End bp2020259 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content62% 
IMG OID637772316 
Productputative O-succinylbenzoate synthase 
Protein accessionYP_378119 
Protein GI78185685 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.946546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTC AGCTTCTGGT GCGTTCCTTC AGTTTTTCGC TCCTGCACCC GTTGCGGACC 
GCCACGGGTG TGGTGGCCAA GCGGCGCGGT TGGCTACTGC GGTTGTGCGA TGGCGATACC
GGAGCCGTGG GTTGGGGCGA GGTGGCGCCG TTGCGAACGG AGCAGTGGTT GCACTGCAAA
ATATTGATGG GAGCGCTGCC TGAAGAGGTG AGTCGTCATC AGCTTGAGGT GTTGATCCAT
CAGGGGCCAG GGGCGTTTGG CTTTGGCTTG GGTTCAGCGT TAGCAGAGCT GGATGGACTC
GTTGGGGATT TGTCCTCCCA GCCTTGGCAA GAAGCGCCTT CCCCGGCTCA CCTCCTGCCA
GCGGGCGATC AGATGCTCAT GGCGTTGGAT CAAGTTCTCA TGGATTGCCC GTCATCCCAC
TCGCTCACCT TCAAGTGGAA GGTGGCTACA GAAGCATCGG AACGTGAACA ACGTTGGTTG
GAGCAGCTTC TCGTGCGTCT TCCCCGTAAC GCCAGGCTGC GGTTGGATGC CAATGGCGGG
TGGGACCGCT CTACCGCAGG AGTTTGGCTG GAGCGGTTGC GGCAGGAGTC GCGGTTTGAT
TGGCTGGAGC AGCCATTGGC GGTTGATGAC CACGAAGGGC TCGAGCAACT GGCCAAACGC
GGGTCGGTGG CCCTCGATGA ATCACTTGAG CGTCGACCCG AGCTCCGTGA CAGCTGGATG
GGGTGGCAGG TGCGGCGCCC TGCGGTAGAT GGCGATCCAC GGCCGTTGTT GCGGCAAATC
CAAGCTGGAG TTCCCTATCG AATGGTGAGT ACGGCGTTCG AATCCGGGAT CGCTCGCCGT
TGGGTGCACC ATTTGGCGGC GTTGCAGTGG GTTGGCCCAA CGCCTGCTGC CCCTGGCCTG
GCGCCGGGAT GGTGCCCTGA AGGTGCCTTG TTCTCTCACA ATCCAGAGCT GGTCTGGGCA
GCTGCAGGGT GA
 
Protein sequence
MTLQLLVRSF SFSLLHPLRT ATGVVAKRRG WLLRLCDGDT GAVGWGEVAP LRTEQWLHCK 
ILMGALPEEV SRHQLEVLIH QGPGAFGFGL GSALAELDGL VGDLSSQPWQ EAPSPAHLLP
AGDQMLMALD QVLMDCPSSH SLTFKWKVAT EASEREQRWL EQLLVRLPRN ARLRLDANGG
WDRSTAGVWL ERLRQESRFD WLEQPLAVDD HEGLEQLAKR GSVALDESLE RRPELRDSWM
GWQVRRPAVD GDPRPLLRQI QAGVPYRMVS TAFESGIARR WVHHLAALQW VGPTPAAPGL
APGWCPEGAL FSHNPELVWA AAG