Gene EcolC_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0079 
Symbol 
ID6068372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp83684 
End bp84706 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content47% 
IMG OID641599483 
Productlipopolysaccharide core biosynthesis protein 
Protein accessionYP_001723092 
Protein GI170018138 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02201] lipopolysaccharide heptosyltransferase III, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00999711 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000104869 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTTTTC ATGGGGATAT GTTATTAACT ACTCCCGTCA TTAGTTCGCT GAAAAAAAAT 
TACCCTGACG CAAAAATCGA TGTGCTGCTT TATCAGGACA CCATCCCGAT CCTGTCTGAA
AATCCAGAGA TTAACGCGCT CTACGGCATA AAAAATAAAA AAGCAAAAGC CTCAGAAAAA
ATTGCCAACT TTTTTCATCT CATCAAGGTA TTACGTGCCA ATAAGTATGA CCTTATCGTC
AATCTTACCG ATCAATGGAT GGTTGCTATA CTGGTTCGCT TATTAAATGC CCGTGTGAAA
ATTTCCCAGG ATTATCATCA TCGGCAGTCT GCTTTTTGGC GTAAAAGTTT CACCCATTTG
GTGCCGTTGC AGGGTGGAAA TGTGGTGGAA AGTAACTTAT CCGTGCTGAC ACCATTGGGA
CTTGATTCGT TGGTGAAGCA GACAACCATG AGTTACCCGC CTGCAAGCTG GAAACGTATG
CGTCGCGAAC TTGATCACGC TGGTGTTGGA CAAAATTATG TGGTTATCCA ACCTACGGCG
CGGCAAATCT TCAAATGCTG GGACAACGCC AAGTTTTCCG CTGTGATTGA TGCCTTACAT
GCTCGTGGTT ATGAAGTCGT TCTGACGTCC GGCCCAGATA AAGACGATCT GGCCTGCGTC
AATGAAATTG CGCAGGGATG CCAGACGCCA CCAGTAACGG CGCTGGCTGG AAAGGTGACC
TTCCCGGAAC TTGGTGCGTT AATCGATCAT GCGCAGCTAT TTATTGGCGT TGATTCCGCA
CCGGCGCATA TTGCCGCAGC AGTTAATACG CCGCTGATAT CGCTGTTTGG TGCGACAGAC
CATATTTTCT GGCGTCCCTG GTCAAATAAC ATGATTCAAT TCTGGGCGGG AGATTACCGG
GAAATGCCAA CGCGCGATCA GCGTGACCGA AATGAGATGT ATCTTTCGGT TATTCCGGCG
GCAGATGTCA TTGCTGCTGT CGATAAATTA CTGCCCTCCT CCACGACAGG TACGTCGTTA
TGA
 
Protein sequence
MRFHGDMLLT TPVISSLKKN YPDAKIDVLL YQDTIPILSE NPEINALYGI KNKKAKASEK 
IANFFHLIKV LRANKYDLIV NLTDQWMVAI LVRLLNARVK ISQDYHHRQS AFWRKSFTHL
VPLQGGNVVE SNLSVLTPLG LDSLVKQTTM SYPPASWKRM RRELDHAGVG QNYVVIQPTA
RQIFKCWDNA KFSAVIDALH ARGYEVVLTS GPDKDDLACV NEIAQGCQTP PVTALAGKVT
FPELGALIDH AQLFIGVDSA PAHIAAAVNT PLISLFGATD HIFWRPWSNN MIQFWAGDYR
EMPTRDQRDR NEMYLSVIPA ADVIAAVDKL LPSSTTGTSL