Gene EcDH1_0297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0297 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp333694 
End bp335778 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content55% 
IMG OID 
Product4-alpha-glucanotransferase 
Protein accessionACX37987 
Protein GI260447565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000128437 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGCA AACGTCTGGA TAATGCCGCG CTGGCGGCGG GGATTAGCCC CAATTACATC 
AATGCCCACG GTAAACCGCA GTCGATTAGC GCCGAAACCA AACGGCGTTT GCTTGACGCG
ATGCATCAAC GTACCGCCAC GAAAGTGGCG GTAACGCCAG TCCCGAATGT CATGGTTTAT
ACCAGCGGCA AAAAAATGCC GATGGTGGTG GAGGGCAGCG GCGAATATAG CTGGCTGCTG
ACCACCGAAG AAGGAACGCA GTACAAAGGC CATGTAACGG GGGGCAAAGC GTTCAATCTA
CCGACGAAGC TGCCGGAAGG TTATCACACG CTGACACTCA CCCAGGACGA CCAGCGCGCG
CATTGCCGGG TGATTGTCGC CCCGAAACGC TGTTACGAAC CGCAGGCGTT GCTGAATAAA
CAAAAGCTGT GGGGTGCCTG CGTTCAGCTT TATACGCTGC GATCGGAAAA AAACTGGGGT
ATTGGGGATT TTGGCGATCT CAAAGCGATG CTGGTGGATG TGGCAAAACG TGGCGGGTCG
TTCATTGGCC TGAACCCGAT TCATGCGCTC TATCCGGCAA ATCCGGAGAG CGCCAGCCCA
TACAGCCCGT CTTCTCGCCG TTGGCTGAAT GTGATTTATA TCGACGTTAA CGCCGTTGAA
GATTTCCATC TTAGCGAAGA GGCTCAGGCC TGGTGGCAGT TGCCGACCAC GCAACAGACG
CTGCAACAGG CGCGCGATGC CGACTGGGTC GATTACTCCA CGGTTACCGC CCTAAAAATG
ACAGCATTAC GAATGGCGTG GAAAGGTTTC GCGCAACGTG ATGATGAGCA GATGGCCGCG
TTTCGCCAGT TTGTTGCAGA GCAGGGCGAC AGCCTGTTCT GGCAGGCAGC CTTTGATGCG
CTACATGCCC AGCAAGTGAA AGAGGACGAA ATGCGCTGGG GCTGGCCTGC ATGGCCAGAG
ATGTATCAGA ACGTGGATTC ACCAGAAGTG CGTCAGTTCT GCGAAGAACA TCGTGATGAC
GTCGATTTTT ATCTCTGGTT GCAGTGGCTG GCTTACAGCC AGTTTGCCGC CTGCTGGGAG
ATAAGCCAGG GCTATGAAAT GCCGATTGGC TTGTATCGTG ATCTGGCGGT TGGCGTAGCG
GAAGGTGGGG CGGAAACCTG GTGTGACCGT GAACTATATT GCCTGAAAGC ATCGGTTGGC
GCGCCGCCGG ATATCCTCGG CCCGTTGGGG CAGAACTGGG GATTACCGCC AATGGACCCG
CATATCATCA CCGCGCGTGC CTATGAACCG TTTATCGAGC TGTTGCGTGC CAATATGCAA
AACTGCGGCG CATTACGAAT TGACCATGTG ATGTCGATGC TGCGTTTGTG GTGGATACCG
TATGGCGAGA CGGCAGATCA GGGCGCGTAT GTTCACTATC CGGTGGATGA TCTGCTCTCG
ATTCTGGCAC TCGAAAGTAA ACGTCATCGC TGTATGGTGA TTGGTGAAGA TCTCGGTACC
GTACCGGTAG AGATTGTCGG TAAGCTGCGC AGCAGCGGTG TGTACTCTTA CAAAGTGCTC
TATTTCGAAA ACGACCACGA GAAGACGTTC CGTGCACCGA AAGCGTATCC GGAGCAGTCG
ATGGCGGTTG CGGCGACACA TGACCTGCCA ACGCTGCGCG GTTACTGGGA GTGCGGGGAT
CTAACGCTGG GCAAAACCCT GGGGCTGTAT CCGGATGAAG TGGTACTGCG CGGTCTGTAT
CAGGATCGCG AACTGGCGAA GCAAGGGCTG CTGGATGCAC TGCATAAATA TGGTTGTCTG
CCGAAACGTG CCGGGCATAA GGCATCGTTG ATGTCGATGA CGCCGACGCT GAACCGTGGT
TTGCAGCGCT ACATTGCCGA CAGTAACAGT GCTCTGTTAG GACTACAGCC GGAAGACTGG
CTGGATATGG CCGAACCGGT GAATATTCCT GGCACCAGTT ACCAGTATAA AAACTGGCGA
CGCAAGCTTT CCGCAACGCT TGAGTCGATG TTTGCCGATG ATGGCGTGAA CAAGTTGCTG
AAGGATTTGG ACAGACGGCG CAGAGCTGCA GCGAAGAAGA AGTAG
 
Protein sequence
MESKRLDNAA LAAGISPNYI NAHGKPQSIS AETKRRLLDA MHQRTATKVA VTPVPNVMVY 
TSGKKMPMVV EGSGEYSWLL TTEEGTQYKG HVTGGKAFNL PTKLPEGYHT LTLTQDDQRA
HCRVIVAPKR CYEPQALLNK QKLWGACVQL YTLRSEKNWG IGDFGDLKAM LVDVAKRGGS
FIGLNPIHAL YPANPESASP YSPSSRRWLN VIYIDVNAVE DFHLSEEAQA WWQLPTTQQT
LQQARDADWV DYSTVTALKM TALRMAWKGF AQRDDEQMAA FRQFVAEQGD SLFWQAAFDA
LHAQQVKEDE MRWGWPAWPE MYQNVDSPEV RQFCEEHRDD VDFYLWLQWL AYSQFAACWE
ISQGYEMPIG LYRDLAVGVA EGGAETWCDR ELYCLKASVG APPDILGPLG QNWGLPPMDP
HIITARAYEP FIELLRANMQ NCGALRIDHV MSMLRLWWIP YGETADQGAY VHYPVDDLLS
ILALESKRHR CMVIGEDLGT VPVEIVGKLR SSGVYSYKVL YFENDHEKTF RAPKAYPEQS
MAVAATHDLP TLRGYWECGD LTLGKTLGLY PDEVVLRGLY QDRELAKQGL LDALHKYGCL
PKRAGHKASL MSMTPTLNRG LQRYIADSNS ALLGLQPEDW LDMAEPVNIP GTSYQYKNWR
RKLSATLESM FADDGVNKLL KDLDRRRRAA AKKK