Gene B21_01942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01942 
SymbolwcaJ 
ID8114348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2018393 
End bp2019787 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content54% 
IMG OID644848157 
Producthypothetical protein 
Protein accessionYP_002999730 
Protein GI251785426 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0026544 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATC TAAAAAAGCG CGAGCGAGCG AAAACCAATG CATCGTTAAT CTCTATGGTG 
CAACGCTTTT CAGATATCAC CATCATGTTT GCCGGACTAT GGCTGGTTTG CGAAGTCAGC
GGACTGTCAT TCCTCTACAT GCACCTGTTG GTGGCGCTGA TTACGCTGGT GGTGTTCCAG
ATGTTGGGCG GCATCACCGA TTTTTATCGC TCATGGCGCG GTGTTCGGGC AGCGACAGAA
TTTGCCCTGC TGCTACAAAA CTGGACCTTA AGCGTGATTT TCAGCGCCGG ACTGGTGGCG
TTCAACAATG ATTTCGACAC GCAACTGAAA ATCTGGCTGG CGTGGTATGG GCTGACCAGT
ATCGGACTGG TGGTTTGCCG TTCATGTATT CGCATTGGGG CGGGCTGGCT GCGTAATCAT
GGCTATAACA AGCGCATGGT CGCCGTAGCG GGGGATTTAG CCGCCGGACA AATGCTGATG
GAGAGCTTCC GTAATCAGCC GTGGTTAGGG TTTGAAGTGG TGGGCGTATA CCACGACCCA
AAACCGGGCG GCGTTTCTAA CGACTGGGCG GGCAATCTGC AACAACTGGT CGAGGATGCG
AAAGCGGGCA AGATTCATAA CGTCTATATC GCGATGCAGA TGTGCGACGG CGCGCGAGTG
AAAAAACTGG TCCATCAACT GGCGGACACC ACCTGTTCGG TGCTGCTGAT CCCCGATGTC
TTTACCTTCA ACATTCTCCA TTCACGCCTC GAAGAGATGA ATGGCGTACC GGTGGTGCCG
CTGTATGACA CGCCGCTTTC CGGGGTTAAC CGCCTGCTCA AACGTGCGGA AGACATTGTG
CTGGCGACGC TTATCCTGCT GCTGATCTCC CCGGTGCTGT GCTGCATTGC GCTGGCGGTG
AAACTGAGCT CACCGGGGCC GGTTATTTTC CGCCAGACTC GCTACGGCAT GGATGGCAAA
CCGATCAAAG TGTGGAAGTT CCGATCCATG AAAGTGATGG AGAACGACAA AGTGGTGACT
CAGGCGACGC AGAACGATCC GCGCGTCACC AAAGTGGGGA ACTTTCTGCG CCGCACCTCG
CTGGACGAAT TGCCGCAGTT TATCAATGTG CTGACTGGCG GGATGTCGAT TGTCGGTCCA
CGTCCGCACG CGGTGGCGCA TAACGAACAG TATCGACAGC TCATTGAAGG CTACATGCTG
CGCCATAAGG TGAAACCGGG CATTACCGGC TGGGCGCAGA TTAATGGCTG GCGTGGCGAA
ACCGACACGC TGGAGAAAAT GGAAAAACGC GTCGAGTTCG ACCTTGAGTA CATCCGCGAA
TGGAGCGTCT GGTTCGATAT CAAAATCGTT TTCCTGACGG TATTCAAGGG CTTCGTTAAC
AAAGCGGCAT ATTGA
 
Protein sequence
MTNLKKRERA KTNASLISMV QRFSDITIMF AGLWLVCEVS GLSFLYMHLL VALITLVVFQ 
MLGGITDFYR SWRGVRAATE FALLLQNWTL SVIFSAGLVA FNNDFDTQLK IWLAWYGLTS
IGLVVCRSCI RIGAGWLRNH GYNKRMVAVA GDLAAGQMLM ESFRNQPWLG FEVVGVYHDP
KPGGVSNDWA GNLQQLVEDA KAGKIHNVYI AMQMCDGARV KKLVHQLADT TCSVLLIPDV
FTFNILHSRL EEMNGVPVVP LYDTPLSGVN RLLKRAEDIV LATLILLLIS PVLCCIALAV
KLSSPGPVIF RQTRYGMDGK PIKVWKFRSM KVMENDKVVT QATQNDPRVT KVGNFLRRTS
LDELPQFINV LTGGMSIVGP RPHAVAHNEQ YRQLIEGYML RHKVKPGITG WAQINGWRGE
TDTLEKMEKR VEFDLEYIRE WSVWFDIKIV FLTVFKGFVN KAAY