Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3244 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 3487020 |
End bp | 3488216 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | ACX40868 |
Protein GI | 260450446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCT GGATATTTAT CTGTATGTCC ATAGCAATGT TGCTATGGTT TTTAAGTACG CTAAGACGTA AACCCAGTCA AAAGAAAGGC TGTATTGACG CCATTATACC TGCGTATAAC GAAGGCCCGT GTCTGGCGCA GTCACTGGAT AATCTACTGC GTAACCCTTA TTTTTGCCGG GTAATTTGCG TTAACGACGG CTCCACGGAC AATACCGAAG CGGTCATGGC GGAAGTCAAA CGCAAATGGG GCGACCGCTT TGTTGCCGTC ACGCAAAAAA ATACCGGTAA AGGTGGTGCG CTGATGAATG GCCTCAATTA CGCCACCTGC GACCAGGTTT TTTTAAGTGA TGCCGACACC TATGTTCCGC CCGATCAAGA CGGAATGGGC TATATGCTGG CAGAAATTGA GCGCGGTGCC GATGCCGTAG GCGGCATTCC CTCTACTGCG TTGAAAGGCG CGGGTCTGTT ACCGCACATC CGCGCGACCG TAAAGTTGCC GATGATTGTT ATGAAGCGCA CGCTACAGCA GCTCCTGGGT GGCGCACCGT TTATTATCAG CGGTGCCTGC GGGATGTTCC GTACTGATGT ATTGCGTAAG TTCGGTTTCT CGGATCGTAC TAAAGTCGAA GACCTTGATC TCACCTGGAC ATTGGTGGCA AACGGCTACC GTATTCGGCA GGCGAATCGC TGCATCGTAT ACCCACAGGA ATGCAACAGC CCGCGTGAGG AGTGGCGTCG CTGGCGGCGT TGGATTGTGG GATACGCGGT CTGTATGCGC CTGCATAAAA GACTTTTATT TAGCCGCTTC GGTATCTTCA GTATATTTCC TATGCTGTTG GTTGTGCTTT ATGGCGTTGG GATTTATCTC ACTACCTGGT TTAATGAATT CATCACCACC GGGCCGCATG GAGTGGTGTT GGCAATGTTT CCGCTTATCT GGGTCGGCGT AGTTTGTGTT ATTGGTGCTT TTAGCGCCTG GTTTCATCGT TGCTGGTTGT TGGTGCCTTT AGCGCCGCTT TCCGTTGTGT ATGTATTATT AGCTTATGCC ATCTGGATTA TTTATGGACT TATTGCCTTT TTTACTGGAC GCGAACCTCA GCGCGACAAA CCCACCCGCT ATTCCGCACT GGTGGAAGCG TCAACCGCTT ATTCCCAACC TTCTGTCACA GGAACTGAAA AACTATCTGA AGCTTAA
|
Protein sequence | MKTWIFICMS IAMLLWFLST LRRKPSQKKG CIDAIIPAYN EGPCLAQSLD NLLRNPYFCR VICVNDGSTD NTEAVMAEVK RKWGDRFVAV TQKNTGKGGA LMNGLNYATC DQVFLSDADT YVPPDQDGMG YMLAEIERGA DAVGGIPSTA LKGAGLLPHI RATVKLPMIV MKRTLQQLLG GAPFIISGAC GMFRTDVLRK FGFSDRTKVE DLDLTWTLVA NGYRIRQANR CIVYPQECNS PREEWRRWRR WIVGYAVCMR LHKRLLFSRF GIFSIFPMLL VVLYGVGIYL TTWFNEFITT GPHGVVLAMF PLIWVGVVCV IGAFSAWFHR CWLLVPLAPL SVVYVLLAYA IWIIYGLIAF FTGREPQRDK PTRYSALVEA STAYSQPSVT GTEKLSEA
|
| |