Gene EcDH1_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1613 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1760929 
End bp1762149 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionACX39278 
Protein GI260448856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0151504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTCG GCTTCTTTTT ACTGAAATTC CCGCTGTCGT CAGAAACCTT CGTCCTTAAC 
CAAATTACCG CGTTTATTGA TATGGGATTT GAGGTAGAGA TTCTCGCGCT GCAAAAAGGC
GACACACAAA ACACCCACGC GGCATGGACG AAATACAACC TTGCTGCCAG AACCCGCTGG
TTACAGGACG AACCTACGGG CAAAGTGGCG AAACTGCGCC ACCGAGCCAG CCAGACCTTG
CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTCAACC TCAAACGCTA TGGTGCCGAG
TCGCGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCGTT TCGCGCCGAT
GTGTTCATCG CTCATTTTGG TCCCGCGGGG GTAACCGCAG CAAAACTCCG CGAACTGGGT
GTCATTCGCG GCAAAATTGC CACTATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG
CTCAACCACT ACACTCCCGA ATATCAGCAA CTGTTTCGCC GTGGCGACCT GATGTTACCG
ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC
GTATCGCGCA TGGGCGTAGA TATGACGCGC TTTAGCCCGC GTCCCGTGAA AGCGCCCGCA
ACGCCGCTGG AGATTATTTC CGTCGCACGC TTAACCGAGA AAAAAGGCCT GCATGTGGCG
ATCGAAGCCT GCCGTCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC
ATTGGCCCGT GGGAAAGACG CCTGCGCACC CTCATCGAAC AATATCAACT GGAAGATGTG
GTGGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA CGACGCGGAT
GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGTGATA TGGAAGGTAT TCCGGTGGCG
CTAATGGAAG CGATGGCGGT CGGTATTCCG GTGGTTTCTA CTCTGCATAG TGGAATACCG
GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG
GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AATTGGCTCC GGTCGTCAAA
CGCGCGCGCG AAAAAGTTGA ACACGATTTT AACCAGCAGG TGATCAATCG AGAACTCGCC
AGCTTGCTGC AGGCTTTATA G
 
Protein sequence
MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEILALQKG DTQNTHAAWT KYNLAARTRW 
LQDEPTGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFRAD
VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP
ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA
IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD
VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL
AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL