Gene B21_00314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00314 
SymbolfrmA 
ID8113144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp347698 
End bp348807 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID644846600 
Producthypothetical protein 
Protein accessionYP_002998173 
Protein GI251783869 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAC GTGCTGCCGT TGCATTTGCT CCCGGTAAAC CGCTGGAAAT CGTTGAAATT 
GACGTTGCAC CACCGAAAAA AGGTGAAGTG CTGATTAAAG TCACCCATAC CGGCGTTTGC
CATACCGACG CATTTACCCT CTCCGGGGAT GACCCGGAAG GTGTATTCCC GGTGGTTCTC
GGTCACGAAG GGGCCGGCGT TGTGGTTGAA GTCGGTGAAG GCGTAACCAG CGTCAAACCT
GGCGACCATG TGATCCCGCT TTACACCGCG GAGTGCGGCG AGTGTGAGTT CTGTCGTTCT
GGCAAAACTA ACCTCTGTGT TGCGGTTCGC GAAACCCAGG GTAAAGGCTT GATGCCAGAC
GGCACCACCC GTTTTTCTTA CAACGGGCAG CCGCTTTATC ACTACATGGG ATGCTCAACA
TTCAGTGAAT ACACCGTGGT CGCGGAAGTG TCTCTGGCCA AAATTAATCC AGAAGCAAAC
CATGAACACG TCTGCCTGCT GGGCTGTGGC GTGACCACCG GTATTGGCGC GGTGCACAAC
ACAGCTAAAG TCCAGCCAGG TGATTCTGTT GCCGTGTTTG GTCTTGGCGC GATTGGTCTG
GCAGTGGTTC AGGGCGCGCG TCAGGCGAAA GCGGGACGGA TTATCGCTAT CGATACCAAC
CCGAAGAAAT TCGATCTGGC TCGTCGCTTC GGTGCTACCG ACTGCATTAA CCCGAATGAC
TACGACAAAC CGATTAAAGA TGTCCTGCTG GATATCAACA AATGGGGTAT CGACCATACC
TTTGAATGCA TCGGTAACGT CAACGTGATG CGTGCGGCGC TGGAAAGTGC GCACCGCGGC
TGGGGTCAGT CGGTGATCAT CGGGGTAGCA GGTGCCGGTC AGGAAATCTC CACCCGACCA
TTCCAGTTGG TCACCGGTCG CGTATGGAAA GGTTCCGCGT TTGGCGGCGT GAAAGGTCGT
TCCCAGTTAC CGGGTATGGT TGAAGATGCG ATGAAAGGTG ATATCGATCT GGAACCGTTT
GTCACGCATA CCATGAGCCT TGATGAAATT AATGACGCCT TCGACCTGAT GCATGAAGGC
AAATCCATTC GAACCGTAAT TCGTTACTGA
 
Protein sequence
MKSRAAVAFA PGKPLEIVEI DVAPPKKGEV LIKVTHTGVC HTDAFTLSGD DPEGVFPVVL 
GHEGAGVVVE VGEGVTSVKP GDHVIPLYTA ECGECEFCRS GKTNLCVAVR ETQGKGLMPD
GTTRFSYNGQ PLYHYMGCST FSEYTVVAEV SLAKINPEAN HEHVCLLGCG VTTGIGAVHN
TAKVQPGDSV AVFGLGAIGL AVVQGARQAK AGRIIAIDTN PKKFDLARRF GATDCINPND
YDKPIKDVLL DINKWGIDHT FECIGNVNVM RAALESAHRG WGQSVIIGVA GAGQEISTRP
FQLVTGRVWK GSAFGGVKGR SQLPGMVEDA MKGDIDLEPF VTHTMSLDEI NDAFDLMHEG
KSIRTVIRY