Gene EcDH1_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1401 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1504412 
End bp1506064 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID 
Productglycosyl transferase family 39 
Protein accessionACX39073 
Protein GI260448651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGG TACGTTACCT TATCGGCCTC TTCGCGTTTA TTGCCTGCTA TTACCTGTTA 
CCGATCAGCA CGCGTCTGCT CTGGCAACCC GATGAAACGC GTTATGCGGA AATCAGTCGA
GAAATGCTGG CATCCGGCGA CTGGATTGTG CCCCATCTGT TAGGGCTACG TTATTTCGAA
AAACCCATTG CCGGATACTG GATTAACAGC ATTGGGCAAT GGCTATTTGG CGCGAATAAC
TTTGGTGTGC GGGCAGGCGT TATCTTTGCG ACCCTGTTAA CTGCCGCGCT GGTGACCTGG
TTTACTCTGC GCTTATGGCG CGATAAACGT CTGGCTCTAC TCGCCACAGT AATTTATCTC
TCATTGTTTA TTGTCTATGC CATCGGCACT TATGCCGTGC TCGATCCGTT TATTGCATTC
TGGCTGGTGG CGGGAATGTG CAGCTTCTGG CTGGCAATGC AGGCACAGAC GTGGAAAGGC
AAAAGCGCAG GATTTTTACT GCTGGGAATC ACCTGCGGCA TGGGGGTGAT GACCAAAGGT
TTTCTCGCCC TTGCCGTGCC GGTATTAAGC GTGCTGCCAT GGGTAGCGAC GCAAAAACGC
TGGAAAGATC TCTTTATTTA CGGCTGGCTG GCGGTTATCA GTTGCGTACT GACGGTTCTC
CCTTGGGGAC TGGCGATAGC GCAGCGGGAG CCTAACTTCT GGCACTATTT TTTCTGGGTT
GAGCATATTC AACGCTTTGC ACTGGATGAT GCCCAACATA GAGCTCCGTT CTGGTACTAC
GTGCCGGTCA TCATTGCCGG TAGCCTGCCG TGGCTGGGAT TACTCCCCGG TGCACTGTAC
ACAGGCTGGA AAAACCGCAA GCATTCCGCA ACCGTCTATT TGTTGAGCTG GACGATAATG
CCGCTGCTGT TTTTCTCCGT CGCTAAAGGT AAATTGCCCA CCTATATTCT TTCCTGCTTT
GCATCTCTGG CAATGCTGAT GGCGCATTAC GCTTTGCTGG CAGCAAAAAA TAATCCTCTG
GCGCTGCGGA TTAATGGCTG GATTAACATC GCTTTTGGCG TCACTGGCAT TATTGCCACA
TTTGTGGTCT CCCCGTGGGG ACCAATGAAC ACGCCGGTGT GGCAAACCTT CGAGAGCTAT
AAAGTCTTTT GTGCCTGGTC GATTTTTTCG CTATGGGCAT TTTTCGGCTG GTACACCTTA
ACAAACGTCG AAAAGACCTG GCCTTTTGCC GCGCTTTGCC CGCTGGGGCT GGCGTTGCTG
GTAGGATTTT CAATTCCTGA CAGAGTTATG GAAGGAAAAC ATCCGCAATT TTTTGTCGAG
ATGACACAAG AATCACTTCA GCCAAGCCGC TATATTCTTA CTGATAGCGT CGGTGTTGCC
GCAGGTCTGG CATGGAGCCT GCAACGCGAT GACATCATCA TGTATCGCCA GACAGGTGAG
TTGAAATACG GCCTTAATTA TCCGGATGCG AAAGGGAGAT TTGTCAGCGG TGATGAGTTC
GCAAACTGGC TTAATCAACA TCGTCAGGAG GGGATTATTA CTCTCGTGCT TTCGGTTGAC
CGCGATGAAG ATATCAACAG TCTCGCCATT CCGCCCGCAG ATGCCATCGA TCGTCAGGAG
CGTCTGGTGC TGATTCAGTA TCGTCCCAAA TGA
 
Protein sequence
MKSVRYLIGL FAFIACYYLL PISTRLLWQP DETRYAEISR EMLASGDWIV PHLLGLRYFE 
KPIAGYWINS IGQWLFGANN FGVRAGVIFA TLLTAALVTW FTLRLWRDKR LALLATVIYL
SLFIVYAIGT YAVLDPFIAF WLVAGMCSFW LAMQAQTWKG KSAGFLLLGI TCGMGVMTKG
FLALAVPVLS VLPWVATQKR WKDLFIYGWL AVISCVLTVL PWGLAIAQRE PNFWHYFFWV
EHIQRFALDD AQHRAPFWYY VPVIIAGSLP WLGLLPGALY TGWKNRKHSA TVYLLSWTIM
PLLFFSVAKG KLPTYILSCF ASLAMLMAHY ALLAAKNNPL ALRINGWINI AFGVTGIIAT
FVVSPWGPMN TPVWQTFESY KVFCAWSIFS LWAFFGWYTL TNVEKTWPFA ALCPLGLALL
VGFSIPDRVM EGKHPQFFVE MTQESLQPSR YILTDSVGVA AGLAWSLQRD DIIMYRQTGE
LKYGLNYPDA KGRFVSGDEF ANWLNQHRQE GIITLVLSVD RDEDINSLAI PPADAIDRQE
RLVLIQYRPK