Gene EcolC_0087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0087 
Symbol 
ID6068621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp91528 
End bp92781 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content33% 
IMG OID641599491 
ProductO-antigen polymerase 
Protein accessionYP_001723100 
Protein GI170018146 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0556509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.17743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTT GTTGGAATGA AATTAATTCT GGTATAAAGT CTTTAATTCT CATATTATGT 
ATTTTTTCTT TAATGACTTT GTCTTTATGG GATGATGTTG CAACAAAGTT TCTTCATGCA
GCTGGAATTA TATCTGCATT GTATTTTCTT GTGACACCAA AAAAAACAAT AACTAATAAT
CCTACTTTGT TAATTTTCAT CTCATTATGT CTTTTGGGTA TCGTAAATAT CATCTGGTAT
TCACATTATA AAATTTCAGG CTCTGTTTAT ACCAATGCAT ATCGTGGCCC AATGGAAACT
GGAAAAATTG CCTTGTGTAG CGCTTTTATT TTCTTAGTTC TTTTTGCTAA AAATGAGATG
AGAACAAAAA TAAAATTTGG GAAACTAATT CTGTTCGCAT CCCTGGCAAC GCAGTTACTT
TTTTTTGCGC ATGCCATGTG GCAACATTTC TATTTAAACG TCGACCGTGT TGCATTATCA
GCTTCCCACG CTACAACAGC AGGCTACATC ATCCTTTTTC CTTCTTTACT GGCATCAATT
CTCATTTTAA AATCCGACTT TAGACATAAA ACAACATTAT ATACAATTAA CTTCATGCTT
AGCTTATGTG CTGTCATAGT AACTGAGACG CGTGCAGCCA TATTAGTGTT TCCATTCTTT
GCGTTAATAT TAATCGTAAT GGATAGTTAT ATTAATAAGC GAATTAATTA TAAGTTATAT
TGTTTTATTA CGATTGCATT ATTAGCAGGT GTATTTTCTT TTAAAGATAC ATTGCTTATG
AGAATGAATG ACTTAAATAA CGATTTAGTT AATTATTCGC ATGATAACAC CAGAACTTCA
GTCGGTGCCC GTCTGGCAAT GTATGAAGTT GGCTTAAAAA CATATTCTCC AATAGGACAA
TCACTGGAAA AACGTGCGGA AAAAATACAT GAACTAGAAG AAAAAGAGCC TAGATTGAGT
GGCGCTTTAC CCTATGTAGA TTCTCATTTG CATAACGATC TCATAGATAC GTTATCAACG
CGTGGTATTC CTGGAGTTGT ATTAACAATT TTAGCATTTT CAGCAATACT CATATATGCC
TTAAGAACTG CTAAAGAACC TTATATTTTA ATCTTGCTTT TTTCACTACT GGTAGTAGGA
CTAAGTGATG TAATACTCTT TTCTAAACCG GTTCCGACTG CTGTGTTTAT CACCATAATA
TTGCTTTGTG CTTATTTTAA AGCACAATCG GACCAATGTT TATTAGAGAA GTAA
 
Protein sequence
MSFCWNEINS GIKSLILILC IFSLMTLSLW DDVATKFLHA AGIISALYFL VTPKKTITNN 
PTLLIFISLC LLGIVNIIWY SHYKISGSVY TNAYRGPMET GKIALCSAFI FLVLFAKNEM
RTKIKFGKLI LFASLATQLL FFAHAMWQHF YLNVDRVALS ASHATTAGYI ILFPSLLASI
LILKSDFRHK TTLYTINFML SLCAVIVTET RAAILVFPFF ALILIVMDSY INKRINYKLY
CFITIALLAG VFSFKDTLLM RMNDLNNDLV NYSHDNTRTS VGARLAMYEV GLKTYSPIGQ
SLEKRAEKIH ELEEKEPRLS GALPYVDSHL HNDLIDTLST RGIPGVVLTI LAFSAILIYA
LRTAKEPYIL ILLFSLLVVG LSDVILFSKP VPTAVFITII LLCAYFKAQS DQCLLEK