Gene ECD_02191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02191 
SymbolmenD 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2267763 
End bp2269433 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content58% 
IMG OID 
Product2-succinyl-6-hydroxy-2, 4-cyclohexadiene-1-carboxylate synthase 
Protein accessionACT44014 
Protein GI253978344 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTAA GCGCATTTAA CCGACGCTGG GCGGCGGTCA TTCTGGAAGC ATTAACGCGT 
CACGGCGTCA GACACATCTG TATCGCCCCA GGCTCGCGTT CTACACCGTT AACGTTAGCG
GCGGCGGAGA ATTCCGCATT CATTCACCAC ACCCATTTCG ATGAGCGTGG GTTGGGGCAT
CTGGCGCTGG GGCTGGCGAA AGTCAGCAAG CAGCCGGTGG CGGTGATTGT GACCTCCGGC
ACGGCGGTGG CAAATCTCTA TCCGGCACTG ATTGAAGCCG GGTTAACCGG AGAAAAACTG
ATTCTCTTAA CCGCCGATCG CCCGCCGGAG CTAATTGACT GCGGCGCGAA TCAGGCAATT
CGCCAGCCGG GAATGTTCGC CTCTCACCCC ACGCACAGTA TTTCATTGCC GCGCCCGACC
CAGGATATCC CCGCACGTTG GCTGGTTTCT ACCATCGACC ACGCTCTCGG TACGCTTCAT
GCGGGGGGAG TCCATATCAA CTGCCCGTTT GCTGAACCGC TGTATGGCGA AATGGACGAT
ACCGGGCTTA GCTGGCAACA GCGTCTGGGT GACTGGTGGC AGGACGACAA ACCGTGGCTG
CGTGAAGCGC CTCGTCTGGA AAGTGAAAAA CAGCGCGACT GGTTCTTCTG GCGACAAAAG
CGCGGCGTGG TGGTTGCCGG GCGCATGAGT GCGGAAGAGG GCAAAAAAGT TGCCCTGTGG
GCGCAAACTC TTGGCTGGCC GCTGATTGGC GATGTGCTGT CACAAACCGG GCAGCCGCTG
CCGTGTGCCG ATCTTTGGTT AGGCAATGCC AAAGCGACCA GCGAGCTGCA GCAGGCGCAA
ATTGTGGTGC AACTGGGAAG CAGCCTGACG GGGAAACGGC TCCTGCAATG GCAGGCAAGC
TGTGAACCAG AAGAGTACTG GATTGTTGAT GACATTGAAG GGCGACTTGA TCCGGCACAC
CATCGCGGAC GTCGCTTAAT TGCCAATATT GCCGACTGGC TGGAGCTGCA TCCGGCAGAA
AAACGCCAGC CCTGGTGCGT TGAAATCCCG CGCCTGGCGG AACAGGCAAT GCAGGCGGTT
ATTGCCCGCC GTGATGCGTT TGGCGAAGCG CAACTGGCGC ATCGCATCTG CGACTATCTG
CCTGAACAGG GGCAATTGTT TGTTGGTAAC AGCCTGGTGG TACGTCTGAT TGATGCGCTT
TCGCAACTTC CGGCAGGTTA CCCGGTGTAC AGCAACCGTG GGGCCAGCGG TATCGACGGG
CTGCTTTCGA CCGCCGCCGG CGTTCAGCGG GCAAGCGGCA AACCGACGCT GGCGATTGTG
GGCGATCTCT CCGCACTTTA CGATCTCAAC GCGCTGGCGT TATTGCGTCA GGTTTCTGCG
CCGCTGGTAT TAATTGTGGT GAACAACAAC GGCGGGCAAA TTTTCTCGCT GTTGCCAACG
CCGCAAAGCG AGCGTGAGCG TTTCTATCTG ATGCCGCAAA ACGTCCATTT TGAGCACGCC
GCCGCGATGT TCGAGCTGAA ATATCATCGT CCGCAAAACT GGCAGGAACT TGAAACGGCA
TTTGCCGACG CCTGGCGCAC GCCAACCACC ACGGTGATTG AAATGGTGGT TAACGACACC
GATGGTGCGC AAACGCTCCA GCAACTTCTG GCGCAGGTAA GCCATTTATG A
 
Protein sequence
MSVSAFNRRW AAVILEALTR HGVRHICIAP GSRSTPLTLA AAENSAFIHH THFDERGLGH 
LALGLAKVSK QPVAVIVTSG TAVANLYPAL IEAGLTGEKL ILLTADRPPE LIDCGANQAI
RQPGMFASHP THSISLPRPT QDIPARWLVS TIDHALGTLH AGGVHINCPF AEPLYGEMDD
TGLSWQQRLG DWWQDDKPWL REAPRLESEK QRDWFFWRQK RGVVVAGRMS AEEGKKVALW
AQTLGWPLIG DVLSQTGQPL PCADLWLGNA KATSELQQAQ IVVQLGSSLT GKRLLQWQAS
CEPEEYWIVD DIEGRLDPAH HRGRRLIANI ADWLELHPAE KRQPWCVEIP RLAEQAMQAV
IARRDAFGEA QLAHRICDYL PEQGQLFVGN SLVVRLIDAL SQLPAGYPVY SNRGASGIDG
LLSTAAGVQR ASGKPTLAIV GDLSALYDLN ALALLRQVSA PLVLIVVNNN GGQIFSLLPT
PQSERERFYL MPQNVHFEHA AAMFELKYHR PQNWQELETA FADAWRTPTT TVIEMVVNDT
DGAQTLQQLL AQVSHL