Gene EcDH1_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3244 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3487020 
End bp3488216 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content50% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionACX40868 
Protein GI260450446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCT GGATATTTAT CTGTATGTCC ATAGCAATGT TGCTATGGTT TTTAAGTACG 
CTAAGACGTA AACCCAGTCA AAAGAAAGGC TGTATTGACG CCATTATACC TGCGTATAAC
GAAGGCCCGT GTCTGGCGCA GTCACTGGAT AATCTACTGC GTAACCCTTA TTTTTGCCGG
GTAATTTGCG TTAACGACGG CTCCACGGAC AATACCGAAG CGGTCATGGC GGAAGTCAAA
CGCAAATGGG GCGACCGCTT TGTTGCCGTC ACGCAAAAAA ATACCGGTAA AGGTGGTGCG
CTGATGAATG GCCTCAATTA CGCCACCTGC GACCAGGTTT TTTTAAGTGA TGCCGACACC
TATGTTCCGC CCGATCAAGA CGGAATGGGC TATATGCTGG CAGAAATTGA GCGCGGTGCC
GATGCCGTAG GCGGCATTCC CTCTACTGCG TTGAAAGGCG CGGGTCTGTT ACCGCACATC
CGCGCGACCG TAAAGTTGCC GATGATTGTT ATGAAGCGCA CGCTACAGCA GCTCCTGGGT
GGCGCACCGT TTATTATCAG CGGTGCCTGC GGGATGTTCC GTACTGATGT ATTGCGTAAG
TTCGGTTTCT CGGATCGTAC TAAAGTCGAA GACCTTGATC TCACCTGGAC ATTGGTGGCA
AACGGCTACC GTATTCGGCA GGCGAATCGC TGCATCGTAT ACCCACAGGA ATGCAACAGC
CCGCGTGAGG AGTGGCGTCG CTGGCGGCGT TGGATTGTGG GATACGCGGT CTGTATGCGC
CTGCATAAAA GACTTTTATT TAGCCGCTTC GGTATCTTCA GTATATTTCC TATGCTGTTG
GTTGTGCTTT ATGGCGTTGG GATTTATCTC ACTACCTGGT TTAATGAATT CATCACCACC
GGGCCGCATG GAGTGGTGTT GGCAATGTTT CCGCTTATCT GGGTCGGCGT AGTTTGTGTT
ATTGGTGCTT TTAGCGCCTG GTTTCATCGT TGCTGGTTGT TGGTGCCTTT AGCGCCGCTT
TCCGTTGTGT ATGTATTATT AGCTTATGCC ATCTGGATTA TTTATGGACT TATTGCCTTT
TTTACTGGAC GCGAACCTCA GCGCGACAAA CCCACCCGCT ATTCCGCACT GGTGGAAGCG
TCAACCGCTT ATTCCCAACC TTCTGTCACA GGAACTGAAA AACTATCTGA AGCTTAA
 
Protein sequence
MKTWIFICMS IAMLLWFLST LRRKPSQKKG CIDAIIPAYN EGPCLAQSLD NLLRNPYFCR 
VICVNDGSTD NTEAVMAEVK RKWGDRFVAV TQKNTGKGGA LMNGLNYATC DQVFLSDADT
YVPPDQDGMG YMLAEIERGA DAVGGIPSTA LKGAGLLPHI RATVKLPMIV MKRTLQQLLG
GAPFIISGAC GMFRTDVLRK FGFSDRTKVE DLDLTWTLVA NGYRIRQANR CIVYPQECNS
PREEWRRWRR WIVGYAVCMR LHKRLLFSRF GIFSIFPMLL VVLYGVGIYL TTWFNEFITT
GPHGVVLAMF PLIWVGVVCV IGAFSAWFHR CWLLVPLAPL SVVYVLLAYA IWIIYGLIAF
FTGREPQRDK PTRYSALVEA STAYSQPSVT GTEKLSEA