Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3833 |
Symbol | |
ID | 5593283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3828575 |
End bp | 3829828 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640922945 |
Product | O-antigen polymerase |
Protein accession | YP_001460423 |
Protein GI | 157163105 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000000244631 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTT GTTGGAATGA AATTAATTCT GGTATAAAGT CTTTAATTCT CATATTATGT ATTTTTTCTT TAATGACTTT GTCTTTATGG GATGATGTTG CAACAAAGTT TCTTCATGCA GCTGGAATTA TATCTGCATT GTATTTTCTT GTGACACCAA AAAAAACAAT AACTAATAAT CCTACTTTGT TAATTTTCAT CTCATTATGT CTTTTGGGTA TCGTAAATAT CATCTGGTAT TCACATTATA AAATTTCAGG CTCTGTTTAT ACCAATGCAT ATCGTGGCCC AATGGAAACT GGAAAAATTG CCTTGTGTAG CGCTTTTATT TTCTTAGTTC TTTTTGCTAA AAATGAGATG AGAACAAAAA TAAAATTTGG GAAACTAATT CTGTTCGCAT CCCTGGCAAC GCAGTTACTT TTTTTTGCGC ATGCCATGTG GCAACATTTC TATTTAAACG TCGACCGTGT TGCATTATCA GCTTCCCACG CTACAACAGC AGGCTACATC ATCCTTTTTC CTTCTTTACT GGCATCAATT CTCATTTTAA AATCCGACTT TAGACATAAA ACAACATTAT ATACAATTAA CTTCATGCTT AGCTTATGTG CTGTCATAGT AACTGAGACG CGTGCAGCCA TATTAGTGTT TCCATTCTTT GCGTTAATAT TAATCGTAAT GGATAGTTAT ATTAATAAGC GAATTAATTA TAAGTTATAT TGTTTTATTA CGATTGCATT ATTAGCAGGT GTATTTTCTT TTAAAGATAC ATTGCTTATG AGAATGAATG ACTTAAATAA CGATTTAGTT AATTATTCGC ATGATAACAC CAGAACTTCA GTCGGTGCCC GTCTGGCAAT GTATGAAGTT GGCTTAAAAA CATATTCTCC AATAGGACAA TCACTGGAAA AACGTGCGGA AAAAATACAT GAACTAGAAG AAAAAGAGCC TAGATTGAGT GGCGCTTTAC CCTATGTAGA TTCTCATTTG CATAACGATC TCATAGATAC GTTATCAACG CGTGGTATTC CTGGAGTTGT ATTAACAATT TTAGCATTTT CAGCAATACT CATATATGCC TTAAGAACTG CTAAAGAACC TTATATTTTA ATCTTGCTTT TTTCACTACT GGTAGTAGGA CTAAGTGACG TAATACTCTT TTCTAAACCG GTTCCGACTG CTGTGTTTAT CACCATAATA TTGCTTTGTG CTTATTTTAA AGCACAATCG GACCAATGTT TATTAGAGAA GTAA
|
Protein sequence | MSFCWNEINS GIKSLILILC IFSLMTLSLW DDVATKFLHA AGIISALYFL VTPKKTITNN PTLLIFISLC LLGIVNIIWY SHYKISGSVY TNAYRGPMET GKIALCSAFI FLVLFAKNEM RTKIKFGKLI LFASLATQLL FFAHAMWQHF YLNVDRVALS ASHATTAGYI ILFPSLLASI LILKSDFRHK TTLYTINFML SLCAVIVTET RAAILVFPFF ALILIVMDSY INKRINYKLY CFITIALLAG VFSFKDTLLM RMNDLNNDLV NYSHDNTRTS VGARLAMYEV GLKTYSPIGQ SLEKRAEKIH ELEEKEPRLS GALPYVDSHL HNDLIDTLST RGIPGVVLTI LAFSAILIYA LRTAKEPYIL ILLFSLLVVG LSDVILFSKP VPTAVFITII LLCAYFKAQS DQCLLEK
|
| |