Gene EcE24377A_4125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4125 
Symbol 
ID5589715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4114930 
End bp4116183 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content33% 
IMG OID640927744 
ProductO-antigen polymerase 
Protein accessionYP_001465104 
Protein GI157156511 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.314741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTT GTTGGAATGA AATTAATTCT GGTATAAAGT CTTTAATTCT CATATTATGT 
ATTTTTTCTT TAATGACTTT GTCTTTATGG GATGATGTTG CAACAAAGTT TCTTCATGCA
GCTGGAATTA TATCTGCATT GTATTTTCTT GCGACACCAA AAAAAACAAT AACTAATAAT
CCTACTTTGT TAATTTTCAT CTCATTATGT CTTTTGGGTA TCGTAAATAT CATCTGGTAT
TCACATTATA AAGTTTCAGG CTCTGTTTAT ACCAATGCAT ATCGTGGCCC AATGGAAACT
GGAAAAATTG CCTTGTGTAG CGCTTTTATT TTCTTAGTTC TTTTTGCTAA AAATGAGATG
AGAACAAAAA TAAAATTTGG GAAACTAATT CTGTTCGCAT CCCTGGCAAC GCAGTTACTT
TTTTTTGCGC ATGCCATGTG GCAACATTTC TATTTAAACG TCGACCGTGT TGCATTATCA
GCTTCCCACG CTACAACAGC AGGCTACATC ATCCTTTTTC CTTCTTTACT GGCATCAATT
CTCATTTTAA AATCCGACTT AAGACATAAA ACAACATTAT ATACAATTAA CTTCATGCTT
AGCTTATGTG CTGTCATAGT AACTGAGACG CGTGCAGCCA TATTAGTGTT TCCATTCTTT
GCGTTAATAT TAATCGTAAT GGATAGTTAT ATTAATAAGC GAATTAATTA TAAGTTATAT
TGTTTTATTA CGATTGCATT ATTAGCAGGT GTATTTTCTT TTAAAGATAC ATTGCTTATG
AGAATGAATG ACTTAAATAA CGATTTAGTT AATTATTCGC ATGATAACAC CAGAACTTCA
GTCGGTGCCC GTCTGGCAAT GTATGAAGTT GGCTTAAAAA CATATTCTCC AATAGGACAA
TCACTGGAAA AACGTGCGGA AAAAATACAT GAACTAGAAG AAAAAGAGCC TAGATTGAGT
GGCGCTTTAC CCTATGTAGA TTCTCATTTG CATAACGATC TCATAGATAC GTTATCAACG
CGTGGTATTC CTGGAGTTGT ATTAACAATT TTAGCATTTT CAGCAATACT CATATATGCC
TTAAGAACTG CTAAAGAACC TTATATTTTA ATCTTGCTTT TTTCACTACT GGTAGTAGGA
CTAAGTGATG TAATACTCTT TTCTAAACCG GTTCCGACTG CTGTGTTTAT CACCATAATA
TTGCTTTGTG CTTATTTTAA AGCACAATCG GACCAATGTT TATTAGAGAA GTAA
 
Protein sequence
MSFCWNEINS GIKSLILILC IFSLMTLSLW DDVATKFLHA AGIISALYFL ATPKKTITNN 
PTLLIFISLC LLGIVNIIWY SHYKVSGSVY TNAYRGPMET GKIALCSAFI FLVLFAKNEM
RTKIKFGKLI LFASLATQLL FFAHAMWQHF YLNVDRVALS ASHATTAGYI ILFPSLLASI
LILKSDLRHK TTLYTINFML SLCAVIVTET RAAILVFPFF ALILIVMDSY INKRINYKLY
CFITIALLAG VFSFKDTLLM RMNDLNNDLV NYSHDNTRTS VGARLAMYEV GLKTYSPIGQ
SLEKRAEKIH ELEEKEPRLS GALPYVDSHL HNDLIDTLST RGIPGVVLTI LAFSAILIYA
LRTAKEPYIL ILLFSLLVVG LSDVILFSKP VPTAVFITII LLCAYFKAQS DQCLLEK