Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0079 |
Symbol | |
ID | 6068372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 83684 |
End bp | 84706 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641599483 |
Product | lipopolysaccharide core biosynthesis protein |
Protein accession | YP_001723092 |
Protein GI | 170018138 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | [TIGR02201] lipopolysaccharide heptosyltransferase III, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00999711 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000104869 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTTTTC ATGGGGATAT GTTATTAACT ACTCCCGTCA TTAGTTCGCT GAAAAAAAAT TACCCTGACG CAAAAATCGA TGTGCTGCTT TATCAGGACA CCATCCCGAT CCTGTCTGAA AATCCAGAGA TTAACGCGCT CTACGGCATA AAAAATAAAA AAGCAAAAGC CTCAGAAAAA ATTGCCAACT TTTTTCATCT CATCAAGGTA TTACGTGCCA ATAAGTATGA CCTTATCGTC AATCTTACCG ATCAATGGAT GGTTGCTATA CTGGTTCGCT TATTAAATGC CCGTGTGAAA ATTTCCCAGG ATTATCATCA TCGGCAGTCT GCTTTTTGGC GTAAAAGTTT CACCCATTTG GTGCCGTTGC AGGGTGGAAA TGTGGTGGAA AGTAACTTAT CCGTGCTGAC ACCATTGGGA CTTGATTCGT TGGTGAAGCA GACAACCATG AGTTACCCGC CTGCAAGCTG GAAACGTATG CGTCGCGAAC TTGATCACGC TGGTGTTGGA CAAAATTATG TGGTTATCCA ACCTACGGCG CGGCAAATCT TCAAATGCTG GGACAACGCC AAGTTTTCCG CTGTGATTGA TGCCTTACAT GCTCGTGGTT ATGAAGTCGT TCTGACGTCC GGCCCAGATA AAGACGATCT GGCCTGCGTC AATGAAATTG CGCAGGGATG CCAGACGCCA CCAGTAACGG CGCTGGCTGG AAAGGTGACC TTCCCGGAAC TTGGTGCGTT AATCGATCAT GCGCAGCTAT TTATTGGCGT TGATTCCGCA CCGGCGCATA TTGCCGCAGC AGTTAATACG CCGCTGATAT CGCTGTTTGG TGCGACAGAC CATATTTTCT GGCGTCCCTG GTCAAATAAC ATGATTCAAT TCTGGGCGGG AGATTACCGG GAAATGCCAA CGCGCGATCA GCGTGACCGA AATGAGATGT ATCTTTCGGT TATTCCGGCG GCAGATGTCA TTGCTGCTGT CGATAAATTA CTGCCCTCCT CCACGACAGG TACGTCGTTA TGA
|
Protein sequence | MRFHGDMLLT TPVISSLKKN YPDAKIDVLL YQDTIPILSE NPEINALYGI KNKKAKASEK IANFFHLIKV LRANKYDLIV NLTDQWMVAI LVRLLNARVK ISQDYHHRQS AFWRKSFTHL VPLQGGNVVE SNLSVLTPLG LDSLVKQTTM SYPPASWKRM RRELDHAGVG QNYVVIQPTA RQIFKCWDNA KFSAVIDALH ARGYEVVLTS GPDKDDLACV NEIAQGCQTP PVTALAGKVT FPELGALIDH AQLFIGVDSA PAHIAAAVNT PLISLFGATD HIFWRPWSNN MIQFWAGDYR EMPTRDQRDR NEMYLSVIPA ADVIAAVDKL LPSSTTGTSL
|
| |