Gene ECD_02187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02187 
SymbolmenE 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2263822 
End bp2265177 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content56% 
IMG OID 
ProductO-succinylbenzoic acid-CoA ligase 
Protein accessionACT44010 
Protein GI253978340 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTTCT CTGACTGGCC GTGGCGTCAC TGGCGGCAAG TGCGGGGAGA AACCATCGCC 
TTACGTCTTA ATGACGAGCA ACTCAACTGG CGCGAGCTTT GTGCTCGCGT CGATGAATTA
GCCTCCGGAT TTGCGGTGCA GGGGGTGGTT GAGGGCAGCG GCGTGATGTT GCGGGCGTGG
AATACGCCGC AAACGCTGCT CGCCTGGCTG GCGTTACTGC AATGCGGGGC GCGGGTGTTG
CCCGTGAACC CTCAGCTGCC GCAACCGTTG CTTGAAGAAT TGCTGCCCAA TCTGACGCTG
CAATTTGCTC TGGTGCCGGA TGGGGAAAAC ACGTTTCCGG CATTAACGTC GCTGCACATT
CAGCTGGTTG AAGGCGCACA TGCCGCTACG TGGCAGCCGA CGCGTCTGTG CTCAATGACG
TTGACCTCGG GTTCTACCGG TTTGCCGAAA GCCGCTGTCC ATACTTATCA GGCCCATCTT
GCCAGTGCGC AAGGTGTGTT ATCGCTGATT CCGTTTGGCG ATCACGATGA TTGGTTGCTC
TCTTTACCGC TGTTTCACGT CTCCGGTCAG GGAATTATGT GGCGCTGGTT ATACGCTGGT
GCGCGGATGA CGGTACGTGA TAAACAGCCA TTGGAGCAAA TGCTGGCAGG CTGTACTCAC
GCTTCACTGG TGCCAACACA ACTCTGGCGT TTGCTGGTTA ACCGTAGTTC CGTTTCCCTG
AAAGCGGTGT TACTTGGCGG CGCGGCTATC CCGGTCGAGT TGACGGAACA GGCGCGCGAG
CAGGGGATTC GTTGCTTTTG CGGCTATGGT CTGACCGAGT TTGCCTCCAC GGTGTGTGCG
AAAGAAGCCG ACGGCCTGGC AGACGTTGGT TCGCCGCTGC CGGGTCGGGA AGTGAAAATC
GTTAATAATG AAGTGTGGCT GCGGGCTGCC AGTATGGCAG AAGGTTACTG GCGTAACGGG
CAACTGGTTT CACTGGTTAA TGACGAAGGC TGGTACGCTA CGCGCGATCG CGGTGAGATG
CATAATGGCA AGCTGACCAT TGTCGGACGT TTAGACAATC TATTCTTCAG TGGCGGAGAG
GGTATTCAGC CGGAAGAAGT CGAGCGCGTA ATTGCTGCAC ATCCTGCGGT TTTGCAGGTG
TTTATCGTCC CCGTTGCCGA CAAGGAGTTT GGTCATCGAC CGGTGGCGGT GATGGAGTAT
GACCACGAGA GCGTTGATCT TAGTGAATGG GTGAAAGATA AGCTGGCCCG TTTTCAACAA
CCGGTGCGCT GGCTAACTCT GCCGCCGGAG CTGAAAAACG GCGGTATTAA AATTTCACGT
CAGGCGCTAA AAGAGTGGGT GCAACGTCAG CAATAA
 
Protein sequence
MIFSDWPWRH WRQVRGETIA LRLNDEQLNW RELCARVDEL ASGFAVQGVV EGSGVMLRAW 
NTPQTLLAWL ALLQCGARVL PVNPQLPQPL LEELLPNLTL QFALVPDGEN TFPALTSLHI
QLVEGAHAAT WQPTRLCSMT LTSGSTGLPK AAVHTYQAHL ASAQGVLSLI PFGDHDDWLL
SLPLFHVSGQ GIMWRWLYAG ARMTVRDKQP LEQMLAGCTH ASLVPTQLWR LLVNRSSVSL
KAVLLGGAAI PVELTEQARE QGIRCFCGYG LTEFASTVCA KEADGLADVG SPLPGREVKI
VNNEVWLRAA SMAEGYWRNG QLVSLVNDEG WYATRDRGEM HNGKLTIVGR LDNLFFSGGE
GIQPEEVERV IAAHPAVLQV FIVPVADKEF GHRPVAVMEY DHESVDLSEW VKDKLARFQQ
PVRWLTLPPE LKNGGIKISR QALKEWVQRQ Q