Gene B21_02150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02150 
SymbolmenD 
ID8114728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2267072 
End bp2268742 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content58% 
IMG OID644848357 
Producthypothetical protein 
Protein accessionYP_002999930 
Protein GI251785626 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1165] 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase 
TIGRFAM ID[TIGR00173] 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylic-acid synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTAA GCGCATTTAA CCGACGCTGG GCGGCGGTCA TTCTGGAAGC ATTAACGCGT 
CACGGCGTCA GACACATCTG TATCGCCCCA GGCTCGCGTT CTACACCGTT AACGTTAGCG
GCGGCGGAGA ATTCCGCATT CATTCACCAC ACCCATTTCG ATGAGCGTGG GTTGGGGCAT
CTGGCGCTGG GGCTGGCGAA AGTCAGCAAG CAGCCGGTGG CGGTGATTGT GACCTCCGGC
ACGGCGGTGG CAAATCTCTA TCCGGCACTG ATTGAAGCCG GGTTAACCGG AGAAAAACTG
ATTCTCTTAA CCGCCGATCG CCCGCCGGAG CTAATTGACT GCGGCGCGAA TCAGGCAATT
CGCCAGCCGG GAATGTTCGC CTCTCACCCC ACGCACAGTA TTTCATTGCC GCGCCCGACC
CAGGATATCC CCGCACGTTG GCTGGTTTCT ACCATCGACC ACGCTCTCGG TACGCTTCAT
GCGGGGGGAG TCCATATCAA CTGCCCGTTT GCTGAACCGC TGTATGGCGA AATGGACGAT
ACCGGGCTTA GCTGGCAACA GCGTCTGGGT GACTGGTGGC AGGACGACAA ACCGTGGCTG
CGTGAAGCGC CTCGTCTGGA AAGTGAAAAA CAGCGCGACT GGTTCTTCTG GCGACAAAAG
CGCGGCGTGG TGGTTGCCGG GCGCATGAGT GCGGAAGAGG GCAAAAAAGT TGCCCTGTGG
GCGCAAACTC TTGGCTGGCC GCTGATTGGC GATGTGCTGT CACAAACCGG GCAGCCGCTG
CCGTGTGCCG ATCTTTGGTT AGGCAATGCC AAAGCGACCA GCGAGCTGCA GCAGGCGCAA
ATTGTGGTGC AACTGGGAAG CAGCCTGACG GGGAAACGGC TCCTGCAATG GCAGGCAAGC
TGTGAACCAG AAGAGTACTG GATTGTTGAT GACATTGAAG GGCGACTTGA TCCGGCACAC
CATCGCGGAC GTCGCTTAAT TGCCAATATT GCCGACTGGC TGGAGCTGCA TCCGGCAGAA
AAACGCCAGC CCTGGTGCGT TGAAATCCCG CGCCTGGCGG AACAGGCAAT GCAGGCGGTT
ATTGCCCGCC GTGATGCGTT TGGCGAAGCG CAACTGGCGC ATCGCATCTG CGACTATCTG
CCTGAACAGG GGCAATTGTT TGTTGGTAAC AGCCTGGTGG TACGTCTGAT TGATGCGCTT
TCGCAACTTC CGGCAGGTTA CCCGGTGTAC AGCAACCGTG GGGCCAGCGG TATCGACGGG
CTGCTTTCGA CCGCCGCCGG CGTTCAGCGG GCAAGCGGCA AACCGACGCT GGCGATTGTG
GGCGATCTCT CCGCACTTTA CGATCTCAAC GCGCTGGCGT TATTGCGTCA GGTTTCTGCG
CCGCTGGTAT TAATTGTGGT GAACAACAAC GGCGGGCAAA TTTTCTCGCT GTTGCCAACG
CCGCAAAGCG AGCGTGAGCG TTTCTATCTG ATGCCGCAAA ACGTCCATTT TGAGCACGCC
GCCGCGATGT TCGAGCTGAA ATATCATCGT CCGCAAAACT GGCAGGAACT TGAAACGGCA
TTTGCCGACG CCTGGCGCAC GCCAACCACC ACGGTGATTG AAATGGTGGT TAACGACACC
GATGGTGCGC AAACGCTCCA GCAACTTCTG GCGCAGGTAA GCCATTTATG A
 
Protein sequence
MSVSAFNRRW AAVILEALTR HGVRHICIAP GSRSTPLTLA AAENSAFIHH THFDERGLGH 
LALGLAKVSK QPVAVIVTSG TAVANLYPAL IEAGLTGEKL ILLTADRPPE LIDCGANQAI
RQPGMFASHP THSISLPRPT QDIPARWLVS TIDHALGTLH AGGVHINCPF AEPLYGEMDD
TGLSWQQRLG DWWQDDKPWL REAPRLESEK QRDWFFWRQK RGVVVAGRMS AEEGKKVALW
AQTLGWPLIG DVLSQTGQPL PCADLWLGNA KATSELQQAQ IVVQLGSSLT GKRLLQWQAS
CEPEEYWIVD DIEGRLDPAH HRGRRLIANI ADWLELHPAE KRQPWCVEIP RLAEQAMQAV
IARRDAFGEA QLAHRICDYL PEQGQLFVGN SLVVRLIDAL SQLPAGYPVY SNRGASGIDG
LLSTAAGVQR ASGKPTLAIV GDLSALYDLN ALALLRQVSA PLVLIVVNNN GGQIFSLLPT
PQSERERFYL MPQNVHFEHA AAMFELKYHR PQNWQELETA FADAWRTPTT TVIEMVVNDT
DGAQTLQQLL AQVSHL