Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01942 |
Symbol | wcaJ |
ID | 8114348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2018393 |
End bp | 2019787 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644848157 |
Product | hypothetical protein |
Protein accession | YP_002999730 |
Protein GI | 251785426 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0026544 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAATC TAAAAAAGCG CGAGCGAGCG AAAACCAATG CATCGTTAAT CTCTATGGTG CAACGCTTTT CAGATATCAC CATCATGTTT GCCGGACTAT GGCTGGTTTG CGAAGTCAGC GGACTGTCAT TCCTCTACAT GCACCTGTTG GTGGCGCTGA TTACGCTGGT GGTGTTCCAG ATGTTGGGCG GCATCACCGA TTTTTATCGC TCATGGCGCG GTGTTCGGGC AGCGACAGAA TTTGCCCTGC TGCTACAAAA CTGGACCTTA AGCGTGATTT TCAGCGCCGG ACTGGTGGCG TTCAACAATG ATTTCGACAC GCAACTGAAA ATCTGGCTGG CGTGGTATGG GCTGACCAGT ATCGGACTGG TGGTTTGCCG TTCATGTATT CGCATTGGGG CGGGCTGGCT GCGTAATCAT GGCTATAACA AGCGCATGGT CGCCGTAGCG GGGGATTTAG CCGCCGGACA AATGCTGATG GAGAGCTTCC GTAATCAGCC GTGGTTAGGG TTTGAAGTGG TGGGCGTATA CCACGACCCA AAACCGGGCG GCGTTTCTAA CGACTGGGCG GGCAATCTGC AACAACTGGT CGAGGATGCG AAAGCGGGCA AGATTCATAA CGTCTATATC GCGATGCAGA TGTGCGACGG CGCGCGAGTG AAAAAACTGG TCCATCAACT GGCGGACACC ACCTGTTCGG TGCTGCTGAT CCCCGATGTC TTTACCTTCA ACATTCTCCA TTCACGCCTC GAAGAGATGA ATGGCGTACC GGTGGTGCCG CTGTATGACA CGCCGCTTTC CGGGGTTAAC CGCCTGCTCA AACGTGCGGA AGACATTGTG CTGGCGACGC TTATCCTGCT GCTGATCTCC CCGGTGCTGT GCTGCATTGC GCTGGCGGTG AAACTGAGCT CACCGGGGCC GGTTATTTTC CGCCAGACTC GCTACGGCAT GGATGGCAAA CCGATCAAAG TGTGGAAGTT CCGATCCATG AAAGTGATGG AGAACGACAA AGTGGTGACT CAGGCGACGC AGAACGATCC GCGCGTCACC AAAGTGGGGA ACTTTCTGCG CCGCACCTCG CTGGACGAAT TGCCGCAGTT TATCAATGTG CTGACTGGCG GGATGTCGAT TGTCGGTCCA CGTCCGCACG CGGTGGCGCA TAACGAACAG TATCGACAGC TCATTGAAGG CTACATGCTG CGCCATAAGG TGAAACCGGG CATTACCGGC TGGGCGCAGA TTAATGGCTG GCGTGGCGAA ACCGACACGC TGGAGAAAAT GGAAAAACGC GTCGAGTTCG ACCTTGAGTA CATCCGCGAA TGGAGCGTCT GGTTCGATAT CAAAATCGTT TTCCTGACGG TATTCAAGGG CTTCGTTAAC AAAGCGGCAT ATTGA
|
Protein sequence | MTNLKKRERA KTNASLISMV QRFSDITIMF AGLWLVCEVS GLSFLYMHLL VALITLVVFQ MLGGITDFYR SWRGVRAATE FALLLQNWTL SVIFSAGLVA FNNDFDTQLK IWLAWYGLTS IGLVVCRSCI RIGAGWLRNH GYNKRMVAVA GDLAAGQMLM ESFRNQPWLG FEVVGVYHDP KPGGVSNDWA GNLQQLVEDA KAGKIHNVYI AMQMCDGARV KKLVHQLADT TCSVLLIPDV FTFNILHSRL EEMNGVPVVP LYDTPLSGVN RLLKRAEDIV LATLILLLIS PVLCCIALAV KLSSPGPVIF RQTRYGMDGK PIKVWKFRSM KVMENDKVVT QATQNDPRVT KVGNFLRRTS LDELPQFINV LTGGMSIVGP RPHAVAHNEQ YRQLIEGYML RHKVKPGITG WAQINGWRGE TDTLEKMEKR VEFDLEYIRE WSVWFDIKIV FLTVFKGFVN KAAY
|
| |