Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_0297 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 333694 |
End bp | 335778 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | 4-alpha-glucanotransferase |
Protein accession | ACX37987 |
Protein GI | 260447565 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000128437 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGCA AACGTCTGGA TAATGCCGCG CTGGCGGCGG GGATTAGCCC CAATTACATC AATGCCCACG GTAAACCGCA GTCGATTAGC GCCGAAACCA AACGGCGTTT GCTTGACGCG ATGCATCAAC GTACCGCCAC GAAAGTGGCG GTAACGCCAG TCCCGAATGT CATGGTTTAT ACCAGCGGCA AAAAAATGCC GATGGTGGTG GAGGGCAGCG GCGAATATAG CTGGCTGCTG ACCACCGAAG AAGGAACGCA GTACAAAGGC CATGTAACGG GGGGCAAAGC GTTCAATCTA CCGACGAAGC TGCCGGAAGG TTATCACACG CTGACACTCA CCCAGGACGA CCAGCGCGCG CATTGCCGGG TGATTGTCGC CCCGAAACGC TGTTACGAAC CGCAGGCGTT GCTGAATAAA CAAAAGCTGT GGGGTGCCTG CGTTCAGCTT TATACGCTGC GATCGGAAAA AAACTGGGGT ATTGGGGATT TTGGCGATCT CAAAGCGATG CTGGTGGATG TGGCAAAACG TGGCGGGTCG TTCATTGGCC TGAACCCGAT TCATGCGCTC TATCCGGCAA ATCCGGAGAG CGCCAGCCCA TACAGCCCGT CTTCTCGCCG TTGGCTGAAT GTGATTTATA TCGACGTTAA CGCCGTTGAA GATTTCCATC TTAGCGAAGA GGCTCAGGCC TGGTGGCAGT TGCCGACCAC GCAACAGACG CTGCAACAGG CGCGCGATGC CGACTGGGTC GATTACTCCA CGGTTACCGC CCTAAAAATG ACAGCATTAC GAATGGCGTG GAAAGGTTTC GCGCAACGTG ATGATGAGCA GATGGCCGCG TTTCGCCAGT TTGTTGCAGA GCAGGGCGAC AGCCTGTTCT GGCAGGCAGC CTTTGATGCG CTACATGCCC AGCAAGTGAA AGAGGACGAA ATGCGCTGGG GCTGGCCTGC ATGGCCAGAG ATGTATCAGA ACGTGGATTC ACCAGAAGTG CGTCAGTTCT GCGAAGAACA TCGTGATGAC GTCGATTTTT ATCTCTGGTT GCAGTGGCTG GCTTACAGCC AGTTTGCCGC CTGCTGGGAG ATAAGCCAGG GCTATGAAAT GCCGATTGGC TTGTATCGTG ATCTGGCGGT TGGCGTAGCG GAAGGTGGGG CGGAAACCTG GTGTGACCGT GAACTATATT GCCTGAAAGC ATCGGTTGGC GCGCCGCCGG ATATCCTCGG CCCGTTGGGG CAGAACTGGG GATTACCGCC AATGGACCCG CATATCATCA CCGCGCGTGC CTATGAACCG TTTATCGAGC TGTTGCGTGC CAATATGCAA AACTGCGGCG CATTACGAAT TGACCATGTG ATGTCGATGC TGCGTTTGTG GTGGATACCG TATGGCGAGA CGGCAGATCA GGGCGCGTAT GTTCACTATC CGGTGGATGA TCTGCTCTCG ATTCTGGCAC TCGAAAGTAA ACGTCATCGC TGTATGGTGA TTGGTGAAGA TCTCGGTACC GTACCGGTAG AGATTGTCGG TAAGCTGCGC AGCAGCGGTG TGTACTCTTA CAAAGTGCTC TATTTCGAAA ACGACCACGA GAAGACGTTC CGTGCACCGA AAGCGTATCC GGAGCAGTCG ATGGCGGTTG CGGCGACACA TGACCTGCCA ACGCTGCGCG GTTACTGGGA GTGCGGGGAT CTAACGCTGG GCAAAACCCT GGGGCTGTAT CCGGATGAAG TGGTACTGCG CGGTCTGTAT CAGGATCGCG AACTGGCGAA GCAAGGGCTG CTGGATGCAC TGCATAAATA TGGTTGTCTG CCGAAACGTG CCGGGCATAA GGCATCGTTG ATGTCGATGA CGCCGACGCT GAACCGTGGT TTGCAGCGCT ACATTGCCGA CAGTAACAGT GCTCTGTTAG GACTACAGCC GGAAGACTGG CTGGATATGG CCGAACCGGT GAATATTCCT GGCACCAGTT ACCAGTATAA AAACTGGCGA CGCAAGCTTT CCGCAACGCT TGAGTCGATG TTTGCCGATG ATGGCGTGAA CAAGTTGCTG AAGGATTTGG ACAGACGGCG CAGAGCTGCA GCGAAGAAGA AGTAG
|
Protein sequence | MESKRLDNAA LAAGISPNYI NAHGKPQSIS AETKRRLLDA MHQRTATKVA VTPVPNVMVY TSGKKMPMVV EGSGEYSWLL TTEEGTQYKG HVTGGKAFNL PTKLPEGYHT LTLTQDDQRA HCRVIVAPKR CYEPQALLNK QKLWGACVQL YTLRSEKNWG IGDFGDLKAM LVDVAKRGGS FIGLNPIHAL YPANPESASP YSPSSRRWLN VIYIDVNAVE DFHLSEEAQA WWQLPTTQQT LQQARDADWV DYSTVTALKM TALRMAWKGF AQRDDEQMAA FRQFVAEQGD SLFWQAAFDA LHAQQVKEDE MRWGWPAWPE MYQNVDSPEV RQFCEEHRDD VDFYLWLQWL AYSQFAACWE ISQGYEMPIG LYRDLAVGVA EGGAETWCDR ELYCLKASVG APPDILGPLG QNWGLPPMDP HIITARAYEP FIELLRANMQ NCGALRIDHV MSMLRLWWIP YGETADQGAY VHYPVDDLLS ILALESKRHR CMVIGEDLGT VPVEIVGKLR SSGVYSYKVL YFENDHEKTF RAPKAYPEQS MAVAATHDLP TLRGYWECGD LTLGKTLGLY PDEVVLRGLY QDRELAKQGL LDALHKYGCL PKRAGHKASL MSMTPTLNRG LQRYIADSNS ALLGLQPEDW LDMAEPVNIP GTSYQYKNWR RKLSATLESM FADDGVNKLL KDLDRRRRAA AKKK
|
| |