Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2166 |
Symbol | |
ID | 5594718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2143116 |
End bp | 2144231 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640921299 |
Product | glycosyl transferase, group 1 family protein |
Protein accession | YP_001458838 |
Protein GI | 157161520 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 76 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAGTTC TACACGTCTA TAAGACCTAC TATCCCGATA CCTACGGCGG TATTGAGCAG GTCATTTATC AGCTAAGTCA GGGCTGCGCC CGCCGGGGAA TCGCAGCCGA TGTTTTTACT TTTAGCCCGG ACAAAGAGAC AGGTCCTGTC GCCTACGAAG ACCATCGGGT CATTTATAAT AAGCAGCTTT TTGAAATTGC CTCCACGCCG TTTTCGTTGA AAGCGTTAAA GCGTTTTAAG CAGATTAAAG ATGATTACGA CATCATCAAC TACCATTTTC CGTTTCCTTT CATGGATATG TTGCATCTCT CGGCGCGGCC TGACGCAAGA ACGGTGGTGA CCTATCACTC GGATATTGTG AAACAAAAAC GGTTAATGAA GTTGTACCAG CCGCTGCAGG AGCGATTCCT CGCCAGCGTA GACTGCATCG TCGCCTCGTC GCCCAACTAC GTGGCCTCCA GCCAGACCCT GAAAAAATAT CAGGATAAAA CCGTGGTGAT CCCGTTTGGT CTGGAGCAGC ATGACGTGCA GCACGATCCG CAGCGGGTGG CGCACTGGCG GGAAACCGTC GGCGATAACT TCTTCCTCTT CGTCGGCGCT TTCCGCTACT ACAAAGGGCT GCACATTCTG CTGGATGCCG CCGAGCGTAG CCGGCTGCCG GTGGTGATCG TCGGGGGCGG GCCGCTGGAG GCGGAAGTGC GGCGTGAGGC GCAGCAACGC GGGCTGAGCA ATGTGGTGTT TACCGGCATG CTCAACGACG AAGATAAGTA CATTCTCTTC CAGCTCTGCC GGGGCGTGGT ATTCCCCTCG CATCTGCGCT CTGAGGCGTT TGGCATTACG TTATTGGAAG GCGCACGCTT TGCAAGGCCG CTGATCTCTT GCGAGATCGG TACAGGTACC TCTTTCATTA ACCAGGACAA AGTGAGTGGT TGCGTGATTC CGCCGAATGA TAGCCAGGCG CTGGTGGAGG CGATGAATGA GCTCTGGAAT AACGAGGAAA CCTCCAACCG CTATGGCGAA AACTCGCGTC GTCGTTTTGA AGAGATGTTT ACTGCCGACC ATATGATTGA CGCCTATGTC AATCTCTACA CTACATTGCT GGAAAGCAAA TCCTGA
|
Protein sequence | MRVLHVYKTY YPDTYGGIEQ VIYQLSQGCA RRGIAADVFT FSPDKETGPV AYEDHRVIYN KQLFEIASTP FSLKALKRFK QIKDDYDIIN YHFPFPFMDM LHLSARPDAR TVVTYHSDIV KQKRLMKLYQ PLQERFLASV DCIVASSPNY VASSQTLKKY QDKTVVIPFG LEQHDVQHDP QRVAHWRETV GDNFFLFVGA FRYYKGLHIL LDAAERSRLP VVIVGGGPLE AEVRREAQQR GLSNVVFTGM LNDEDKYILF QLCRGVVFPS HLRSEAFGIT LLEGARFARP LISCEIGTGT SFINQDKVSG CVIPPNDSQA LVEAMNELWN NEETSNRYGE NSRRRFEEMF TADHMIDAYV NLYTTLLESK S
|
| |