Gene EcHS_A3833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3833 
Symbol 
ID5593283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3828575 
End bp3829828 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content33% 
IMG OID640922945 
ProductO-antigen polymerase 
Protein accessionYP_001460423 
Protein GI157163105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000000244631 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTT GTTGGAATGA AATTAATTCT GGTATAAAGT CTTTAATTCT CATATTATGT 
ATTTTTTCTT TAATGACTTT GTCTTTATGG GATGATGTTG CAACAAAGTT TCTTCATGCA
GCTGGAATTA TATCTGCATT GTATTTTCTT GTGACACCAA AAAAAACAAT AACTAATAAT
CCTACTTTGT TAATTTTCAT CTCATTATGT CTTTTGGGTA TCGTAAATAT CATCTGGTAT
TCACATTATA AAATTTCAGG CTCTGTTTAT ACCAATGCAT ATCGTGGCCC AATGGAAACT
GGAAAAATTG CCTTGTGTAG CGCTTTTATT TTCTTAGTTC TTTTTGCTAA AAATGAGATG
AGAACAAAAA TAAAATTTGG GAAACTAATT CTGTTCGCAT CCCTGGCAAC GCAGTTACTT
TTTTTTGCGC ATGCCATGTG GCAACATTTC TATTTAAACG TCGACCGTGT TGCATTATCA
GCTTCCCACG CTACAACAGC AGGCTACATC ATCCTTTTTC CTTCTTTACT GGCATCAATT
CTCATTTTAA AATCCGACTT TAGACATAAA ACAACATTAT ATACAATTAA CTTCATGCTT
AGCTTATGTG CTGTCATAGT AACTGAGACG CGTGCAGCCA TATTAGTGTT TCCATTCTTT
GCGTTAATAT TAATCGTAAT GGATAGTTAT ATTAATAAGC GAATTAATTA TAAGTTATAT
TGTTTTATTA CGATTGCATT ATTAGCAGGT GTATTTTCTT TTAAAGATAC ATTGCTTATG
AGAATGAATG ACTTAAATAA CGATTTAGTT AATTATTCGC ATGATAACAC CAGAACTTCA
GTCGGTGCCC GTCTGGCAAT GTATGAAGTT GGCTTAAAAA CATATTCTCC AATAGGACAA
TCACTGGAAA AACGTGCGGA AAAAATACAT GAACTAGAAG AAAAAGAGCC TAGATTGAGT
GGCGCTTTAC CCTATGTAGA TTCTCATTTG CATAACGATC TCATAGATAC GTTATCAACG
CGTGGTATTC CTGGAGTTGT ATTAACAATT TTAGCATTTT CAGCAATACT CATATATGCC
TTAAGAACTG CTAAAGAACC TTATATTTTA ATCTTGCTTT TTTCACTACT GGTAGTAGGA
CTAAGTGACG TAATACTCTT TTCTAAACCG GTTCCGACTG CTGTGTTTAT CACCATAATA
TTGCTTTGTG CTTATTTTAA AGCACAATCG GACCAATGTT TATTAGAGAA GTAA
 
Protein sequence
MSFCWNEINS GIKSLILILC IFSLMTLSLW DDVATKFLHA AGIISALYFL VTPKKTITNN 
PTLLIFISLC LLGIVNIIWY SHYKISGSVY TNAYRGPMET GKIALCSAFI FLVLFAKNEM
RTKIKFGKLI LFASLATQLL FFAHAMWQHF YLNVDRVALS ASHATTAGYI ILFPSLLASI
LILKSDFRHK TTLYTINFML SLCAVIVTET RAAILVFPFF ALILIVMDSY INKRINYKLY
CFITIALLAG VFSFKDTLLM RMNDLNNDLV NYSHDNTRTS VGARLAMYEV GLKTYSPIGQ
SLEKRAEKIH ELEEKEPRLS GALPYVDSHL HNDLIDTLST RGIPGVVLTI LAFSAILIYA
LRTAKEPYIL ILLFSLLVVG LSDVILFSKP VPTAVFITII LLCAYFKAQS DQCLLEK