Gene EcolC_0082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0082 
Symbol 
ID6068385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp86633 
End bp87649 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content38% 
IMG OID641599486 
Productlipopolysaccharide 3-alpha-galactosyltransferase 
Protein accessionYP_001723095 
Protein GI170018141 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.313803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00265615 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGCCC ACTATTTTAA TCCACAAGAG ATGATCACTA AGACAATCAT CTTCGATGAA 
AGGCCAGCGG CGTCAGTGGC ATCATCATTC CATGTTGCTT ATGGCATTGA TAAAAACTTT
CTTTTTGGTT GTGGTGTTTC AATCACGTCA GTTTTGTTTC ATAACAACGA CGTGAGTTTT
GTTTTCCACG TTTTTATTGA TGATATCCCT GAAGCCGATA TCCAGCGTTT AGCCCAATTG
GCGAAAAGCT ATCGTACCTG TATCCAGATC CATCTGGTAA ATTGTGAACG GCTTAAGGCA
TTACCGACGA CCAAAAATTG GTCTATTGCC ATGTATTTCC GTTTTGTAAT TGCAGATTAC
TTTATTGATC AACAAGATAA GGTTCTTTAC CTGGATGCTG ATATCGCCTG TCAGGGAAAC
TTAAAGCCGC TGATAACAAT GGATCTTGCC AATAACATTG CTGCTGTTGT TACTGAACGC
GATGCTAACT GGTGGTCGTT ACGGGGTCAA AGTCTGCAGT GTAATGAACT TGAAAAGGGT
TACTTTAATT CAGGTGTCCT GTTAATTAAT ACACTAGCGT GGGCGCAGGA GTCCGTTTCT
GCTAAAGCGA TGTCGATGCT TGCTGATAAA GCCGTCGTTT CCCGTTTAAC CTATATGGAT
CAAGATATAC TTAATCTTAT CCTGTCAGGG AAAGTTAAAT TCATTGATGC TAAATACAAT
ACGCAATTTA GTTTAAATTA TGAATTAAAA AAATCATTTG TTTGTCCAAT TAATGATGAA
ACCGTATTAA TTCATTATGT CGGCCCGACA AAACCCTGGC ATTACTGGGC CGGTTATCCA
AGTGCGCGAC CTTTTATCAA AGCCAAAGAG GCATCGCCCT GGAAAAATGA ACCGTTAATG
CGGCCAGTTA ACTCAAACTA TGCTCGTTAT TGCGCCAAGC ATAATTTTAA ACAAAATAAA
CCAATTAACG GGATAATGAA TTATATTTAT TATTTTTATT TAAAGATAAT AAAATGA
 
Protein sequence
MSAHYFNPQE MITKTIIFDE RPAASVASSF HVAYGIDKNF LFGCGVSITS VLFHNNDVSF 
VFHVFIDDIP EADIQRLAQL AKSYRTCIQI HLVNCERLKA LPTTKNWSIA MYFRFVIADY
FIDQQDKVLY LDADIACQGN LKPLITMDLA NNIAAVVTER DANWWSLRGQ SLQCNELEKG
YFNSGVLLIN TLAWAQESVS AKAMSMLADK AVVSRLTYMD QDILNLILSG KVKFIDAKYN
TQFSLNYELK KSFVCPINDE TVLIHYVGPT KPWHYWAGYP SARPFIKAKE ASPWKNEPLM
RPVNSNYARY CAKHNFKQNK PINGIMNYIY YFYLKIIK