Gene B21_01939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01939 
SymbolwcaL 
ID8114341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2014254 
End bp2015474 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID644848154 
Producthypothetical protein 
Protein accessionYP_002999727 
Protein GI251785423 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.156433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCACTGTCGT CGGAAACCTT CGTCCTCAAT 
CAAATTACCG CGTTTATTGA TATGGGCTTT GAGGTGGAGA TTGTCGCGCT GCAAAAAGGC
GACACACAAA ACACCCACGC GGCATGGACA AAATATAACC TTGCCGCCAG AACCCGCTGG
TTACAGGACG AGCCACAAGG CAAAGTGGCG AAACTGCGCC ACCGCGCCAG CCAGACGTTG
CGCGGTATTC ATCGTAAAAA TACCTGGCAG GCGCTTAACC TCAAACGCTA TGGTGCTGAG
TCGCGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCGTT TCATGCCGAT
GTGTTTATCG CTCATTTTGG CCCTGCTGGG GTAACCGCGG CAAAACTACG CGAACTGGGT
GTCATTCGCG GCAAAATTGC CACTATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG
CTCAACCACT ACACTCCCGA ATATCAACAA CTGTTTCGCC GTGGCGACCT GATGTTACCA
ATAAGCAATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC
GTATCGCGCA TGGGCGTGGA CATGACGCGT TTTAGCCCGC GTCCGGTGAA AGCGCCCGCA
ACGCCGCTGG AAATCATCTC CGTCGCACGC TTAACCGAGA AAAAAGGCCT GCATGTGGCG
ATTGAAGCCT GCCGCCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC
ATTGGCCCGT GGGAACGACG CCTGCGCACC CTCATCGAAC AATATCAACT GGAAGATGTG
ATAGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA TGACGCGGAT
GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGCGATA TGGAAGGTAT TCCGGTGGCG
CTAATGGAAG CGATGGCGGT TGGCATTCCG GTTGTTTCTA CTCTGCACAG CGGAATACCG
GAACTGGTGG AGGCCGATAA ATCCGGTTGG CTGGTGCCTG AGAACGATGC CTGTGCACTG
GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AACTTACTAC GGTTGTCAAA
CGTGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATCAATCG AGAACTCGCC
AGCTTGCTGC AGGCTTTATA G
 
Protein sequence
MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW 
LQDEPQGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFHAD
VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP
ISNLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA
IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV IEMPGFKPSH EVKAMLDDAD
VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDACAL
AQRLAAFSQL DTDELTTVVK RAREKVEHDF NQQVINRELA SLLQAL