Gene B21_00075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00075 
SymbolleuA 
ID8113595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp84763 
End bp86334 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID644846369 
Producthypothetical protein 
Protein accessionYP_002997942 
Protein GI251783638 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC AAGTCATTAT TTTCGATACC ACATTGCGCG ACGGTGAACA GGCGTTACAG 
GCAAGCTTGA GTGTGAAAGA AAAACTGCAA ATTGCGCTGG CCCTTGAGCG TATGGGTGTT
GACGTGATGG AAGTCGGTTT CCCCGTTTCT TCGCCGGGTG ATTTTGAATC AGTGCAAACC
ATCGCTCGCC AGGTTAAAAA CAGCCGCGTA TGCGCGTTAG CTCGCTGCGT GGAGAAAGAT
ATCGACGTGG CGGCCGAATC CCTGAAAGTC GCCGAAGCCT TCCGTATTCA TACCTTTATT
GCCACTTCGC CAATGCACAT CGCCACCAAG CTGCGCAGCA CGCTGGATGA AGTGATCGAA
CGCGCTATCT ATATGGTGAA ACGCGCCCGT AATTACACCG ATGATGTTGA ATTTTCTTGC
GAAGATGCCG GGCGTACACC CATTGCCGAT CTGGCGCGAG TGGTCGAAGC GGCGATTAAT
GCCGGTGCCA CCACCATCAA CATTCCGGAC ACCGTGGGCT ACACCATGCC GTTTGAGTTC
GCCGGAATCA TCAGCGGCCT GTATGAACGC GTACCTAACA TCGACAAAGC CATTATTTCC
GTACATACCC ACGACGATTT AGGCCTGGCG GTCGGCAACT CACTGGCGGC GGTACATGCC
GGTGCACGCC AGGTGGAAGG CGCAATGAAC GGGATCGGCG AGCGTGCCGG AAACTGTTCC
CTGGAAGAAG TCATCATGGC GATCAAAGTT CGTAAGGATA TTCTCAACGT CCACACCGCC
ATTAATCACC AGGAGATATG GCGCACCAGC CAGTTAGTTA GCCAGATTTG TAATATGCCG
ATCCCGGCAA ACAAAGCCAT TGTTGGCAGC GGCGCATTCG CACACTCCTC CGGTATCCAC
CAGGATGGCG TGCTGAAAAA CCGCGAAAAC TACGAAATCA TGACACCAGA ATCTATTGGT
CTGAACCAAA TCCAGCTGAA TCTGACCTCT CGTTCGGGGC GTGCGGCGGT GAAACATCGC
ATGGATGAGA TGGGGTATAA AGAAAGTGAA TATAATTTAG ACAATTTGTA CGATGCTTTC
CTGAAGCTGG CGGACAAAAA AGGTCAGGTG TTTGATTACG ATCTGGAGGC GCTGGCCTTC
ATCGGTAAGC AGCAAGAAGA GCCGGAGCAT TTCCGTCTGG ATTACTTCAG CGTGCAGTCT
GGCTCTAACG ATATCGCCAC CGCCGCCGTC AAACTGGCCT GTGGCGAAGA AGTCAAAGCA
GAAGCCGCCA ACGGTAACGG TCCGGTCGAT GCCGTCTATC AGGCAATTAA CCGCATCACT
GAATATAACG TCGAACTGGT GAAATACAGC CTGACCGCCA AAGGCCACGG TAAAGATGCG
CTGGGTCAGG TGGATATCGT CGCTAACTAC AACGGTCGCC GCTTCCACGG CGTCGGCCTG
GCTACCGATA TTGTCGAGTC ATCTGCCAAA GCCATGGTGC ACGTTCTGAA CAATATCTGG
CGTGCCGCAG AAGTCGAAAA AGAGTTGCAA CGCAAAGCTC AACACAACGA AAACAACAAG
GAAACCGTGT GA
 
Protein sequence
MSQQVIIFDT TLRDGEQALQ ASLSVKEKLQ IALALERMGV DVMEVGFPVS SPGDFESVQT 
IARQVKNSRV CALARCVEKD IDVAAESLKV AEAFRIHTFI ATSPMHIATK LRSTLDEVIE
RAIYMVKRAR NYTDDVEFSC EDAGRTPIAD LARVVEAAIN AGATTINIPD TVGYTMPFEF
AGIISGLYER VPNIDKAIIS VHTHDDLGLA VGNSLAAVHA GARQVEGAMN GIGERAGNCS
LEEVIMAIKV RKDILNVHTA INHQEIWRTS QLVSQICNMP IPANKAIVGS GAFAHSSGIH
QDGVLKNREN YEIMTPESIG LNQIQLNLTS RSGRAAVKHR MDEMGYKESE YNLDNLYDAF
LKLADKKGQV FDYDLEALAF IGKQQEEPEH FRLDYFSVQS GSNDIATAAV KLACGEEVKA
EAANGNGPVD AVYQAINRIT EYNVELVKYS LTAKGHGKDA LGQVDIVANY NGRRFHGVGL
ATDIVESSAK AMVHVLNNIW RAAEVEKELQ RKAQHNENNK ETV