Gene B21_01945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01945 
SymbolwcaI 
ID8116113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2022844 
End bp2024067 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content55% 
IMG OID644848160 
Producthypothetical protein 
Protein accessionYP_002999733 
Protein GI251785429 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC 
ACCGGCGAGA TGGTGGAATG GCTGGCGGCA CAAGGTCATG AGGTGCGGGT TATTACCGCA
CCGCCTTACT ACCCGCAGTG GCAGGTGGGC GAGAACTATT CCGCCTGGCG CTACAAACGA
GAAGAGGGGG CCGCCACGGT GTGGCGCTGC CCGCTGTACG TGCCAAAACA GCCGAGCACC
CTGAAACGCT TGTTGCATCT CGGCAGTTTT GCCGTCAGCA GTTTCTTTCC ACTGATGGCG
CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTAGTGC CAACGCTGTT TTGCACGCCG
GGAATGCGCC TGCTGGCGAA GCTCTCTGGT GCGCGTACCG TGCTGCATAT TCAGGATTAC
GAAGTGGACG CCATGCTGGG GCTGGGCCTT GCCGGAAAAG GCAAAGGCGG CAAAGTGGCA
CAGCTGGCGA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT CTCCACGATT
TCGCGTTCGA TGATGAATAA AGCCATCGAA AAAGGCGTGG CGGCGGATAA CATCATTTTC
TTCCCCAACT GGTCGGAAAT CGCCCGTTTT CAGCATGTTG CAGACGCCGA TGTTGATGCC
CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT
GGTGAAAAGC AGGGACTGGA AAACGTTATT GAAGCTGCCG ATCGCCTGCG CGATGAACCG
CTGATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTGGAAAA AATGGCGCAG
CAGCGTGGAC TGCGCAACAT GCAATTTTTC CCGCTGCAAT CGTATGACGC TTTACCCGCA
CTGCTGAAGA TGGGCGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA
TTGCCGTCGA AACTGACCAA TATTCTGGCA GTAGGCGGTA ACGCGGTGAT TACTGCTGAA
GCCTACACAG AACTGGGGCA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTAGAA
CCGGAATCGG TCGAGGCGCT GGTGGCGGGG ATTCGTCAGG CGCTTCTGCT ACCCAAACAC
AACACGGTGG CACGTGAATA TGCCGAACGC ACGCTCGATA AAGAGAACGT GTTACGTCAA
TTTATAAATG ATATTCGGGG ATAA
 
Protein sequence
MKILVYGINY SPELTGIGKY TGEMVEWLAA QGHEVRVITA PPYYPQWQVG ENYSAWRYKR 
EEGAATVWRC PLYVPKQPST LKRLLHLGSF AVSSFFPLMA QRRWKPDRII GVVPTLFCTP
GMRLLAKLSG ARTVLHIQDY EVDAMLGLGL AGKGKGGKVA QLATAFERSG LHNVDNVSTI
SRSMMNKAIE KGVAADNIIF FPNWSEIARF QHVADADVDA LRNQLGLPDN KKIILYSGNI
GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA
LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AYTELGQLCE TFPGIAVCVE
PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG