Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3980 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4291983 |
End bp | 4294169 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | ShET2 enterotoxin domain protein |
Protein accession | ACX41580 |
Protein GI | 260451158 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACTC GTATTCCTCG TAGTTCTTTC TCTGCAAATA TTAATAATAC AGCCCAGACA AATGAACACC AAACCCTGAG TGAATTGTTT TATAAAGAAC TCGAGGATAA ATTTTCTGGC AAGGAGCTGG CGACGCCTCT ATTAAAAAGC TTCTCAGAGA ACTGTAGACA AAATGGTCGT CATATCTTTA GCAACAAGGA TTTTGTCATT AAATTTTCCA CGTCCGTCTT ACAAGCTGAT AAGAAAGAAA TTACGATAAT TAATAAAAAC GAAAACACGA CACTTACTCA AACCATTGCC CCAATATTTG AAAAATACCT AATGGAAATT TTACCTCAAC GCTCAGACAC TCTTGATAAA CAAGAATTAA ACCTAAAATC AGATAGAAAA GAAAAAGAAT TCCCAAGAAT TAAACTTAAT GGTCAATGTT ATTTTCCGGG GCGACCCCAA AACCGTATAG TATGCCGACA CATTGCTGCA CAATATATTA ATGATATTTA TCAGAATGTT GATTACAAAC CCCATCAAGA TGATTACTCT TCAGCTGAAA AATTTCTCAC GCACTTCAAC AAAAAATGCA AAAACCAGAC TTTGGCGTTG GTTTCCAGCC GTCCTGAGGG GCGTTGCGTT GCTGCCTGCG GTGATTTCGG GCTAGTTATG AAAGCATATT TTGACAAGAT GGAATCAAAT GGCATCAGTG TTATGGCAGC CATATTACTG GTGGATAACC ATGCTTTGAC GGTCCGGCTA AGAATAAAGA ACACAACTGA AGGATGTACC CATTACGTGG TTTCGGTTTA TGATCCTAAT GTAACTAACG ATAAAATAAG AATTATGAGC GAAAGCAAAG AGAATATTAA ACACTATTCT CTGATGGATT TTATGAATGT AGATTATAGC CTCCTGAAAT GGTCAAATGA TCATGTTATT AATCAATCTG TTGCAATAAT TCCAGCACTT CCGAAAGAAC AGCTATTGAT GTTAAAAGGA TCTGTGGATG AAATAACCCC TCCATTATCA CCTGCAACGA TGAATTTGCT AATGGCAATT GGTCAGAATC ACCAACTTAC GCAACTGATG ATTCAACTCC AGAAAATGCC AGAACTACAT AGAACAGAAA TGTTGACTGC CTATAATAGT ATTAACCTGC CAGGTTTATA TTTGGCTATA AATTATGGTA ATGCGGATAT CGTTGAGACT ATTTTCAATT CATTGTCAGA AACAGGATAT GAAGGATTAC TCTCGAAAAA AAATCTCATG CATATTCTGG AGGCAAAAGA TAAAAATGGT TTTTCTGGAT TATTTTTAGC GATATCACGT AAGGATAAAA ATGTTGTAAC CTCGATTCTG AACGCCTTAC CTAAACTGGC CGCAACACAT CATTTAGATA ACGAACAAGT GTATAAATTC CTGAGTGCCA AAAATAGAAC GTCCAGCCAT GTTTTATACC ATGTTATGGC GAATGGTGAT GCCGACATGC TGAAAATTGT TTTGAACGCG TTACCTTTGT TAATTCGCAC ATGTCATTTG ACTAAAGAAC AGGTACTCGA TCTCCTGAAG GCAAAGGATT TTTATGGTTG CCCAGGACTA TATCTGGCGA TGCAAAATGG ACATAGCGAT ATCGTGAAAG TTATTCTTGA AGCATTGCCC AGCCTAGCCC AGGAAATTAA CATTTCAGCT TCCGATATTG TAGATCTTCT GACCGCTAAA AGTCTTGCGC GCGACACGGG TTTGTTTATG GCCATGCAGC GCGGACACAT GAACGTTATT AATACTATTT TTAACGCATT ACCCACTCTG TTTAATACGT TTAAATTCGA TAAAAAAAAT ATGAAGCCCC TCCTCCTGGC AAATAATTCT AATGAATATC CCGGTTTGTT TTCAGCGATA CAGCATAAAC AACAAAATGT TGTAGAGACG GTTTATCTTG CTTTATCTGA CCATGCACGC CTGTTTGGAT TTACCGCTGA AGATATTATG GATTTTTGGC AACACAAAGC CCCACAAAAA TACTCTGCCT TTGAGTTGGC TTTTGAATTT GGTCACCGGG TTATTGCTGA ATTAATCCTT AATACATTAA ATAAGATGGC TGAAAGCTTT GGCTTTACGG ATAACCCTCG ATACATTGCG GAGAAAAATT ATATGGAAGC TTTACTCAAA AAAGCATCTC CCCATACCGT ACGCTAA
|
Protein sequence | MITRIPRSSF SANINNTAQT NEHQTLSELF YKELEDKFSG KELATPLLKS FSENCRQNGR HIFSNKDFVI KFSTSVLQAD KKEITIINKN ENTTLTQTIA PIFEKYLMEI LPQRSDTLDK QELNLKSDRK EKEFPRIKLN GQCYFPGRPQ NRIVCRHIAA QYINDIYQNV DYKPHQDDYS SAEKFLTHFN KKCKNQTLAL VSSRPEGRCV AACGDFGLVM KAYFDKMESN GISVMAAILL VDNHALTVRL RIKNTTEGCT HYVVSVYDPN VTNDKIRIMS ESKENIKHYS LMDFMNVDYS LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG SVDEITPPLS PATMNLLMAI GQNHQLTQLM IQLQKMPELH RTEMLTAYNS INLPGLYLAI NYGNADIVET IFNSLSETGY EGLLSKKNLM HILEAKDKNG FSGLFLAISR KDKNVVTSIL NALPKLAATH HLDNEQVYKF LSAKNRTSSH VLYHVMANGD ADMLKIVLNA LPLLIRTCHL TKEQVLDLLK AKDFYGCPGL YLAMQNGHSD IVKVILEALP SLAQEINISA SDIVDLLTAK SLARDTGLFM AMQRGHMNVI NTIFNALPTL FNTFKFDKKN MKPLLLANNS NEYPGLFSAI QHKQQNVVET VYLALSDHAR LFGFTAEDIM DFWQHKAPQK YSAFELAFEF GHRVIAELIL NTLNKMAESF GFTDNPRYIA EKNYMEALLK KASPHTVR
|
| |