Gene EcolC_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0083 
Symbol 
ID6068355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp87666 
End bp88661 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content36% 
IMG OID641599487 
Productlipopolysaccharide glucosyltransferase I 
Protein accessionYP_001723096 
Protein GI170018142 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.110554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00261992 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGAAT TTATAAAAGA ACGGTTTTCG TATTTAGCAG ATAATAAAAA AGAAAACGCC 
CCAGAGCTAA ATGTTTCCTA CGGTATCGAT AAGAATTTTT TGTATGGTGC TGGCGTTTCA
ATTTCTTCCG TTTTGATTAA TAATTCAGAT ATTAATTTTG TCTTTCATGT TTTCACTGAT
TATGTGGATG ATGATTATTT AAAGTCATTT AATGAAACAG CAAAACAATT TAATACCTCA
ATTATTGTAT ATTTAATTGA CCCCAAATAC TTTGCTGATC TGCCGACGTC ACAGTTTTGG
TCGTACGCGA CATACTTCAG GGTATTGTCT TTTGAATATC TGAGTGAAAG TATTTCCACA
CTGCTGTATC TGGATGCCGA TGTTGTTTGT AAAGGAAGCC TGAAACCTCT CACAGAAATT
ATATTTAAAG ATGAGTTTGC TGCGGTCATT CCTGACAATG ATAGTACGCA GGCGGCATGT
GCAAAACGCC TCAACATTCC CGAAATGAAT GGACGTTATT TCAATGCAGG CGTTATCTAT
GTCAATCTTA AAAAATGGCA TGAAGCAAAT TTGACACCGT ATTTACTCAA GCTTTTACGA
GGGGAAACTA AATATGGCTC TCTTAAATAT TTAGATCAGG ATGCGTTGAA TATCGCATTT
AATATGAATA ATATCTACCT CGGGAAGGAT TTTGATACTA TTTATACCCT GAAAAATGAA
CTTCATGATC GTAGTCATCG AAAGTTTCAG CAAACCATTA CCGATAAAAC AGTATTGATT
CACTATACAG GGATAACTAA ACCATGGCAT AGCTGGGCTG GATATCCGTC TGCATCATAC
TTTAATATCG CGCGTGAACA ATCTCCCTGG AAGAAATATC CTCTTAAAGA GGCGCGGACT
GTTGCAGAAA TGCAGAAACA ATATAAGCAT CTGTTTGCCC ATGGTGAGTA TATTAAAGGC
ATAACTTCAT TAATTAAGTA CAAGCTTAAG AAATAA
 
Protein sequence
MNEFIKERFS YLADNKKENA PELNVSYGID KNFLYGAGVS ISSVLINNSD INFVFHVFTD 
YVDDDYLKSF NETAKQFNTS IIVYLIDPKY FADLPTSQFW SYATYFRVLS FEYLSESIST
LLYLDADVVC KGSLKPLTEI IFKDEFAAVI PDNDSTQAAC AKRLNIPEMN GRYFNAGVIY
VNLKKWHEAN LTPYLLKLLR GETKYGSLKY LDQDALNIAF NMNNIYLGKD FDTIYTLKNE
LHDRSHRKFQ QTITDKTVLI HYTGITKPWH SWAGYPSASY FNIAREQSPW KKYPLKEART
VAEMQKQYKH LFAHGEYIKG ITSLIKYKLK K