Gene EcolC_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1387 
Symbol 
ID6067995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1520558 
End bp1521520 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content59% 
IMG OID641600807 
ProductO-succinylbenzoate synthase 
Protein accessionYP_001724378 
Protein GI170019424 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1441] O-succinylbenzoate synthase 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCG CGCAGGTATA CCGCTGGCAG ATCCCCATGG ACGCGGGGGT GGTTCTGCGC 
GACAGGCGGT TAAAAACCCG CGACGGGCTG TATGTTTGCC TGCGTGAAGG CGAGCGCGAA
GGGTGGGGGG AGATCTCCCC ACTGCCGGGC TTCAGTCAGG AAACCTGGGA AGAGGCGCAA
AGTGTGCTGC TTGCCTGGGT AAATAACTGG CTGGCAGGCG ATTGCGAGCT ACCGCAGATG
CCTTCCGTGG CCTTTGGCGT AAGCTGTGCA TTGGCAGAAC TGACAGATAC GTTGCCGCAA
GCAGCCAACT ACCGTGCGGC ACCGCTGTGT AATGGCGATC CGGACGATCT GATCCTCAAA
CTTGCAGATA TGCCAGGCGA GAAAGTGGCG AAGGTCAAAG TGGGATTGTA CGAAGCGGTG
CGCGACGGCA TGGTGGTGAA TCTGTTGCTG GAGGCAATTC CGGATCTGCA TTTGCGTCTT
GACGCAAATC GCGCCTGGAC ACCGCTGAAA GGTCAGCAGT TTGCCAAATA CGTTAACCCG
GATTATCGCG ACCGCATCGC GTTTCTCGAA GAGCCGTGCA AAACCCGCGA TGATTCGCGA
GCGTTTGCCC GTGAAACCGG CATTGCCATT GCCTGGGATG AAAGCCTGCG CGAGCCGGAT
TTTGCCTTTG TGGCTGAAGA GGGCGTGCGC GCGGTAGTTA TCAAACCCAC GCTCACGGGC
AGTCTGGAAA AAGTACGCGA GCAGGTACAG GCGGCGCACG CGCTGGGGCT GACGGCGGTG
ATCAGTTCTT CCATTGAATC GAGCTTAGGC TTAACGCAAC TGGCGCGGAT TGCCGCCTGG
TTAACGCCGG ACACCATTCC AGGGCTGGAC ACGCTGGATC TGATGCAGGC GCAGCAGGTA
CGTCGCTGGC CGGGTAGCAC GCTGCCTGTC GTGGAAGTTG ATGCACTGGA GCGGTTGTTA
TGA
 
Protein sequence
MRSAQVYRWQ IPMDAGVVLR DRRLKTRDGL YVCLREGERE GWGEISPLPG FSQETWEEAQ 
SVLLAWVNNW LAGDCELPQM PSVAFGVSCA LAELTDTLPQ AANYRAAPLC NGDPDDLILK
LADMPGEKVA KVKVGLYEAV RDGMVVNLLL EAIPDLHLRL DANRAWTPLK GQQFAKYVNP
DYRDRIAFLE EPCKTRDDSR AFARETGIAI AWDESLREPD FAFVAEEGVR AVVIKPTLTG
SLEKVREQVQ AAHALGLTAV ISSSIESSLG LTQLARIAAW LTPDTIPGLD TLDLMQAQQV
RRWPGSTLPV VEVDALERLL