Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4013 |
Symbol | |
ID | 6064568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4411436 |
End bp | 4413010 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641603424 |
Product | ShET2 enterotoxin domain-containing protein |
Protein accession | YP_001726939 |
Protein GI | 170021985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0223767 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACTC GCATCCCTCG TAGTTCTTTC TCTGCAAATA TTAATAATAC AGCCCAGACA AATGAACACC AAACCCTGAG TGAATTGTTT TATAAAGAAC TCGAGGATAA ATTTTCTGGC AAGGAGCTGG CGACGCCTCT ATTAAAAAGC TTCTCAGAGA ACTGTAGACA AAATGGTCGT CATATCTTTA GCAACAAGGA TTTTGTCATT AAATTTTCCA CGTCCGTCTT ACAAGCTGAT AAGAAAGAAA TTACGATAAT TAATAAAAAC GAAAACACGA CACTTACTCA AACCATTGCC CCAATATTTG AAGAATACCT AATGGAAATT TTACCTCAAC GCTCAGACAC TCTTGATAAA CAAGAATTAA ACCTAAAATC AGATAGAAAA GAAAAAGAAT TCCCAAGAAT TAAACTTAAT GGTCAATGTT ATTTTCCGGG GCGACCCCAA AACCGTATAG TATGCCGACA CATTGCTGCA CAATATATTA ATGATATTTA TCAGAATGTT GATTACAAAC CCCATCAAGA TGATTACTCT TCAGCTGAAA AATTTCTCAC GCACTTCAAC AAAAAATGCA AAAACCAGAC TTTGGCGTTG GTTTCCAGCC GTCCTGAGGG GCGTTGCGTT GCTGCCTGCG GTGATTTCGG GCTAGTTATG AAAGCATATT TTGACAAGAT GGAATCAAAT GGCATCAGTG TTATGGCAGC CATATTACTG GTGGATAACC ATGCTTTGAC GGTCCGGCTA AGAATAAAGA ACACAACTGA AGGATGTACC CATTACGTGG TTTCGGTTTA TGATCCTAAT GTAACTAACG ATAAAATAAG AATTATGAGC GAAAGCAAAG AGGATATTAA ACACTATTCT CTGATGGATT TTATGAATGT AGATTATAGC CTCCTGAAAT GGTCAAATGA TCATGTTATT AACCAATCTG TTGCAATAAT TCCAGCACTT CCGAAAGAAC AGCTATTGAT GTTAAAAGGA TCTGTGGATG AAATAACCCC TCCATTATCA CCAGCAACGA TGAATTTGCT AATGGCAATT GGTCAGAATC ACCAACTTAC GCAACTGATG ATTCAGCTCC AGAAAATGCC AGAACTACAT AGAACAGAAA TGTTGACTGC CTATAATAGT GGACATATGA ACGTTATTAA TACTATTTTT AACGCATTAC CCACTCTGTT TAATACGTTT AAATTCGATA AAAAAAATAT GAAGCCCCTC CTCCTGGCAA ATAATTCTAA TGAATATCCC GGTTTGTTTT CAGCGATACA GCATAAACAA CAAAATGTTG TAGAGACGGT TTATCTTGCT TTATCTAACC ATGCACGCCT GTTTGGATTT ACCGCTGAAG ATATTATGGA TTTTTGGCAA CACAAAGCCC CACAAAAATA CTCTGCCTTT GAGTTGGCTT TTGAATTGGG TCACCGGGTT ATTGCTGAAT TAATCCTTAA TACATTAAAT AAGATGGCTG AAAGCTTTGG CTTTACGGAT AACCCTCGAT ACATTGCGGA GAAAAATTAT ATGGAAGCTT TACTCAAAAA AGCATCTCCC CATACCGTAC GCTAA
|
Protein sequence | MITRIPRSSF SANINNTAQT NEHQTLSELF YKELEDKFSG KELATPLLKS FSENCRQNGR HIFSNKDFVI KFSTSVLQAD KKEITIINKN ENTTLTQTIA PIFEEYLMEI LPQRSDTLDK QELNLKSDRK EKEFPRIKLN GQCYFPGRPQ NRIVCRHIAA QYINDIYQNV DYKPHQDDYS SAEKFLTHFN KKCKNQTLAL VSSRPEGRCV AACGDFGLVM KAYFDKMESN GISVMAAILL VDNHALTVRL RIKNTTEGCT HYVVSVYDPN VTNDKIRIMS ESKEDIKHYS LMDFMNVDYS LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG SVDEITPPLS PATMNLLMAI GQNHQLTQLM IQLQKMPELH RTEMLTAYNS GHMNVINTIF NALPTLFNTF KFDKKNMKPL LLANNSNEYP GLFSAIQHKQ QNVVETVYLA LSNHARLFGF TAEDIMDFWQ HKAPQKYSAF ELAFELGHRV IAELILNTLN KMAESFGFTD NPRYIAEKNY MEALLKKASP HTVR
|
| |