Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2261 |
Symbol | |
ID | 5588891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2222950 |
End bp | 2229840 |
Gene Length | 6891 bp |
Protein Length | 2296 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640925927 |
Product | putative invasin |
Protein accession | YP_001463325 |
Protein GI | 157155108 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000213417 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAAATGC CAATACGGTG TCCTACACCC TTGGAGCGCT GGAAATCGGC CCAAAGCGTT GCCGAACGTT TCGGTATTTC GGTGGCTGAG TTACGCAAAC TCAACCAGTT TCGTACGTTT GCTCGAGGTT TTGATAATGT CCGCCAGGGT GATGAACTGG ATGTCCCGGC ACAAGTTAGT GAAAATAATT TAACCCCGCC ACCGGGTAAT AGCAGCGGCA ACCTTGAGCA ACAGATAGCC AGTACTTCAC AGCAAATCGG GTCTCTGCTC GCCGAGGATA TGAACAGCGA GCAAGCGGCA AATATGGCGC GTGGATGGGC CTCTTCTCAG GCTTCAGGCG CAATGACAGA CTGGTTAAGC CGCTTCGGTA CCGCAAGAAT CACGCTGGGC GTGGATGAAG ATTTTAGCCT GAAGAACTCC CAGTTCGATT TTCTCCATCC GTGGTATGAA ACGCCTGATA ATCTCTTTTT CAGTCAGCAT ACTCTCCATC GTACTGACGA GCGTACACAG ATTAACAACG GCTTAGGTTG GCGTCATTTC ACTCCCACAT GGATGTCGGG CATCAACTTC TTTTTCGACC ACGATCTTAG CCGTTACCAC TCCCGCGCCG GCATTGGCGC GGAGTACTGG CGCGACTATC TAAAATTAAG CAGTAACGGC TATTTGCGAC TGACCAACTG GCGCAGCGCA CCTGAACTGG ACAACGATTA TGAAGCACGC CCGGCCAATG GCTGGGATGT ACGCGCAGAA GGCTGGCTAC CCGCCTGGCC GTACCTTGGC GGTAAACTGG TCTATGAACA GTATTATGGC GATGAAGTGG CCCTGTTCGA TAAAGACGAT CGGCAAAGTA ATCCTCATGC CATAACCGCT GGACTTAACT ATACCCCCTT CCCGCTAATG ACCTTCAGCG CGGAGCAACG CCAGGGTAAA CAGGGCGAAA ATGACACCCG TTTTGCCGTC GATTTTACCT GGCAACCTGG CAGCGCAATG CAGAAACAGC TTGACCCGAA TGAAGTCGCT GCACGGCGTA GCCTTGCAGG CAGCCGTTAT GATCTGGTGG ATCGCAACAA CAACATCGTT CTGGAATATC GCAAAAAAGA ACTGGTCCGC CTGACCCTGA CAGACCCCGT GACAGGGAAG TCAGGAGAAG TGAAATCACT GGTTTCGTCG CTACAAACCA AATATGCCCT GAAAGGCTAT AACGTCGAAG CCACCGCACT GGAAGCTGCC GGTGGCAAAG TGGTCACAAC GGGTAAAGAT ATTCTGGTTA CCCTGCCGGG TTACCGGTTC ACCAGTACGC CAGAAACCGA TAACACCTGG CCGATTGAAG TCACCGCTGA AGATGTCAAA GGCAATTTGT CGAATCGTGA ACAAAGCATG GTGGTCGTTC AGGCACCTAC GCTAAGCCAG AAAGATTCCT CGGTATCGTT AAGTACCCAA ACATTGAACG CGGATTCCCA TTCAACCGCC ACACTGACTT TTATTGCGCA TGATGCAGCA GGTAATCCTG TTGTCGGGCT GGTGCTCTCG ACGCGTCACG AAGGTGTTCA GGACATCACC CTTTCTGAAT GGAAAGATAA TGGTGACGGA AGCTATACCC AGATCCTGAC CACAGGAGCG ATGTCTGGCA CGCTGACGCT GATGCCACAG CTGAATGGTG TGGATGCGGC TAAAGCCCCC GCCGTGGTGA ATATCATTTC TATTTCGTCA TCCCGAACTC ACTCGTCAAT TAAAATTGAT AAAGACCGTT ATCTCTCCGG CAATCCTATC GAGGTGACGG TAGAACTGAG AGATGAAAAT GACAAACCTG TTAAGGAGCA AAAACAGCAA CTGAATAACG CAGTCAGCAT CGACAACGTG AAACCTGGTG TCACTACAGA CTGGAAAGAA ACCGCAGATG GCGTCTATAA GGCAACCTAT ACCGCCTATA CCAGAGGCAG TGGGCTTACT GCGAAGCTGT TAATGCAAAA CTGGAATGAA GATTTGCATA CCGCTGGATT TATCATCGAC GCCAACCCGC AGTCAGCAAA AATTGCGACA TTATCTGCCA GCAATAATGG TGTGCTCGCC AATGAGAATG CAGCAAACAC CGTCTCGGTC AATGTCGCTG ATGAAGGAAG CAACCCAATC AATGATCATA CCGTCACGTT TGCGGTATTA AGCGGGTCGG CAACTTGCTT TAACAATCAA AACACCGCAA AAACGGATGT TAATGGTCTG GCGACTTTTG ATCTGAAAAG TAGTAAGCAG GAAGACAACA CGGTTGAAGT CACCCTTGAA AATGGCGTGA AACAAACGTT AATTGTCAGT TTTGTCGGCG ACTCGAGTAC CGCGCAGGTT GATCTGCAGA AGTCGAAAAA TGAAGTGGTC GCTGACGGCA ATGATAGCGC CACAATGACC GCGACAGTCC GGGATGCAAA AGGCAACCTG CTCAATGACG TCAAGGTCAC TTTCAATGTT AATTCAGCAG CAGCGAAACT GAGCCAAACC GAAGTGAATA GCCACGACGG GATCGCCACA GCTACGCTGA CCAGTTTGAA AAATGGTGAT TATAGGGTTA CGGCCTCTGT GAGCTCTGGT TCTCAGGCTA ATCAACAGGT GATTTTTATC GGTGATCAAA GTACTGCTGC CCTGACCCTC AGTGTGCCTT CAGGTGATAT CACCGTCACC AACACAGCTC CGCTACATAT GACTGCAACC TTGCAGGATA AAAATGGCAA CCCATTAAAA GATAAAGAAA TCACCTTCTC TGTGCCAAAC GACGTCGCAA GTCGGTTCTC GATTAGCAAC AGCGGAAAAG GCATGACGGA TAGCAACGGG ACTGCAATCG CCTCCCTGAC CGGCACGTTA GCGGGCACGC ATATGATCAC GGCTCGTCTG GCTAACAGCA ATGTCAGCGA TACACAGCCA ATGACGTTTG TGGCGGATAA AGACAGAGCG GTTGTCGTTC TGCAAACATC GAAAGCGGAA ATCATTGGGA ATGGCGTGGA TGAGACAACT CTGACAGCAA CAGTGAAAGA TCCGTCGAAT CATCCGGTGG CGGGGATAAC GGTCAACTTC ACCATGCCAC AGGACGTTGC GGCAAACTTT ACCCTTGAAA ATAACGGTAT TGCCATCACT CAGGCCAATG GGGAAGCGCA TGTCACGCTG AAAGGTAAAA AAGCGGGTAC GCATACGGTT ACCGCAACGC TGGGTAATAA CAATACCAGT GATTCGCAGC CGGTAACGTT TGTGGCGGAC AAAACCTCGG CTCAGGTTGT CCTGCAGATG TCAAAAGATG AGATCACAGG TAATGGCGTC GATAACGCAA CGCTAACTGC AACGGTTAAA GATCAGTTCG ACAATGAGGT GAATAATCTT CCGGTAACAT TCAGCTCAGC CTCTTCAGGA CTCACCCTGA CCCCGGGAGT AAGTAATACC AATGAGTCTG GCATCGCGCA GGCCACTCTC GCAGGCGTTG CCTTTGGTGA GCAGACGGTC ACTGCATCAC TGGCTAATAA TGGTGCCAGC GACAACAAAA CTGTGCATTT TATTGGCGAC ACAGCGGCGG CAAAAATTAT CGAGTTGACG CCTGTCCCAG ACAGCATAAT CGCCGGTACC CCGCAGAACA GCTCCGGCAG CGTCATCACC GCCACAGTCG TTGATAATAA TGGCTTTCCG GTGAAAGGTG TGACTGTGAA CTTCACCAGC AGAACAAACT CTGCCGAAAT GACGAATGGC GGCCAAGCCG TAACGAACGA ACAGGGTAAG GCTACCGTCA CTTATACCAA TACCCGCTCC TCGATAGAAT CAGGAGCGAG ACCGGATACC GTTGAGGCCA GTCTGGAAAA TGGTAGCTCC ACGCTTAGCA CATCAATTAA TGTCAACGCT GATGCGTCTA CGGCACATCT CACCTTGCTA CAGGCACTTT TTGATACAGT CTCCGCAGGC GACACTACCA ATCTGTATAT TGAGGTGAAG GATAATTACG GCAACGGTGT ACCCCAGCAG GAGGTAACCC TCAGAGTATC ACCAAGTGAA GGCGTGACCC CCAGTAATAA CGCTATATAC ACTACCAACC ACGACGGCAA TTTTTACACA AGCTTTACCG CTACAAAAGC CGGGGTTTAT CAAGTGACGG CAACCCTCGA AAATGGCGAT TCGATGCAAC AAACAGTGAC CTATGTGCCG AACGTCGCGA ATGCCGAAAT CACGCTGGCA GCCTCGAAGG ATCCGGTGAT TGCCGACAAT AACGATCTCA CGACACTAAC AGCAACAGTC GCTGATACAG AGGGCAATGC GATAGCCAAC ACGGAAGTAA CATTTACTCT GCCGGAAGAT GTGAAGGCGA ACTTCACGCT GAGCGATGGC GGTAAAGCGA TTACTGATGC TGAAGGCAAA GCGAAAGTCA CGCTGAAAGG TACAAAAGCA GGCGCTCATA CTGTTACAGC ATCGATGACT GGCGGTAAGA GTGAGCAGTT GGTGGTGAAC TTTATTGCGG ATACGCTCAC TGCGCAGGTT AATCTTAACG TTACCGAGGA CAATTTTATC GCTAATAACG TCGGGATGAC CAGGCTGCAG GCAACAGTGA CTGATGGAAA CGGCAACCCG TTAGCCAATG AGGCGGTGAC ATTCACGCTA CCGGCAGATG TGAGCGCAAG CTTTACTCTC GGACAAGGCG GTTCCGCCAT TACTGACATC AACGGCAAGG CTGAAGTTAC ACTGAGCGGT ACAAAATCCG GCACCTACCC CGTGACAGTT AGCGTGAACA ATTATGGTGT CAGTGATACG AAACAGGTGA CGTTGATTGC CGATGCTGGT ACCGCAAAAC TAGCCTCCTT AACCTCTGTA TACTCATTCG TCGTCAGCAC GACTGAGGGC GCGACCATGA CTGCAAGCGT CACTGACACT AACGGCAACC CGGTAGAAGG CATAAAAGTT AATTTCCGCG GAACCTCCGT CACGCTAAGC AGCACCAGCG TTGAAACTGA TGATCGGGGT TTCGCTGAAA TTCTTGTGAC AAGCACCGAG GTCGGACTGA AAACAGTTTC AGCCTCTCTG GCAGATAAAC CTACTGAAGT CATCTCGCGA TTACTGAATG CCAGTGCAGA TGTTAATTCT GCGACGATTA CCAGTCTGGA GATACCTGAA GGTCAGGTAA TGGTCGCACA AGACGTAGCA GTTAAAGCTC ACGTTAACGA CCAGTTTGGC AACCCGGTTG CGCATCAACC CGTGACATTC AGTGCAGAGC CATCCTCGCA AATGATCATT AGCCAGAATA CGGTTTCTAC TAATACGCAG GGTGTAGCCG AGGTCACCAT GACGCCCGAA AGAAACGGTT CGTATATGGT GAAAGCATCC CTGGCGAATG GAGCCTCACT TGAGAAACAA CTGGAGGCTA TTGATGAAAA ACTGACACTC ACGGCGTCCA GTCCGCTTAT CGGTGTCTAT GCCCCTACAG GTGCTACTCT GACGGCAACG CTAACCTCTG CAAATGGCAC TCCAGTGGAG GGTCAGGTCA TCAACTTTAG CGTAACGCCA GAAGGGGCGA CGTTAAGTGG CGGAAAAGTG AGAACTAACT CTTCAGGTCA GGCTCCAGTC GTTCTGACCA GCAATAAAGT CGGTACATAT ACGGTGACTG CATCGTTCCA TAACGGCGTA ACAATACAGA CACAGACAAC CGTGAAAGTC ACTGGCAACT CAAGCACCGC CCATGTTGCT AGCTTTATCG CTGATCCATC GACTATCGCC GCCACCAACA CTGATTTAAG TACCTTAAAG ACAACGGTTG AGGATGGCAG TGGTAACCTG ATCGAAGGTC TCACTGTGTA CTTCGCCTTA AAAAGCGGCT CTGCCACATT AACGTCATTA ACAGCGGTGA CCGATCAAAA CGGAATCGCG ACAACAAGCG TGAAAGGAGC GATGACAGGT AGCGTCACGG TAAGCGCAGT CACGACCGCT GGTGGAATGC AAACAGTAGA TATAACGCTG GTGGCTGGCC CGGCAGACAC CTCGCAGTCC GTCCTTAAGA GCAATCGGTC ATCACTGAAA GGGGACTATA CCGATAGTGC TGAATTACGT CTTGTTCTGC ACGATATATC AGGCAATCCG ATCAAAGTTT CTGAAGGGAT GGAATTTGTG CAATCAGGTA CTAACGTGCC CTATATAAAA ATTAGCGCAA TTGATTACAG TCTAAATATC AACGGTGATT ACAAAGCCAC TGTTACAGGC GGCGGAGAGG GTATCGCAAC GCTGATCCCT GTATTGAATG GTGTTCATCA AGCTGGTCTG AGTACCACAA TACAATTCAC TCGCGCAGAA GACAAAATAA TGAGCGGTAC AGTATCAGTC AATGGTACTG ACCTACCGAC AACTACATTC CCTTCGCAGG GGTTCACTGG GGCGTATTAT CAGTTGAATA ATGACAACTT TGCCCCAGGA AAAACGGCAG CTGATTATGA GTTCTCAAGC TCTGCCTCCT GGGTCGATGT TGATGCTACC GGTAAAGTGA CATTTAAAAA TGTCGGCAGC AATTGGGAAA GGATTACGGC GACGCCAAAA TCAGGAGGCC CTAGCTATAT ATACGAAATC CGTGTGAAGA GTTGGTGGGT GAACTCCGGC GATGCTTTCA TGATATACAG CCTTGCTGAA AATTTTTGCA GTAGCAATGG CTACACACTT CCCCGTGCAG ACCATTTAAA CCATAGTCGT TCCCGAGGCA TCGGGTCACT GTACAGTGAA TGGGGAGATA TGGGGCATTA CACGACTGAA GCTGGTTTTC AATCAAATAT GTATTGGTCA TCTAGTCCCG CAAACTCAAG CGAACAATAC GTAGTTTCCC TGGCAACAGG TGATCAAAGC GTATTTGAAA AGCTTGGGTT TGCTTATGCG ACATGTTATA AAAACCTGTG A
|
Protein sequence | MQMPIRCPTP LERWKSAQSV AERFGISVAE LRKLNQFRTF ARGFDNVRQG DELDVPAQVS ENNLTPPPGN SSGNLEQQIA STSQQIGSLL AEDMNSEQAA NMARGWASSQ ASGAMTDWLS RFGTARITLG VDEDFSLKNS QFDFLHPWYE TPDNLFFSQH TLHRTDERTQ INNGLGWRHF TPTWMSGINF FFDHDLSRYH SRAGIGAEYW RDYLKLSSNG YLRLTNWRSA PELDNDYEAR PANGWDVRAE GWLPAWPYLG GKLVYEQYYG DEVALFDKDD RQSNPHAITA GLNYTPFPLM TFSAEQRQGK QGENDTRFAV DFTWQPGSAM QKQLDPNEVA ARRSLAGSRY DLVDRNNNIV LEYRKKELVR LTLTDPVTGK SGEVKSLVSS LQTKYALKGY NVEATALEAA GGKVVTTGKD ILVTLPGYRF TSTPETDNTW PIEVTAEDVK GNLSNREQSM VVVQAPTLSQ KDSSVSLSTQ TLNADSHSTA TLTFIAHDAA GNPVVGLVLS TRHEGVQDIT LSEWKDNGDG SYTQILTTGA MSGTLTLMPQ LNGVDAAKAP AVVNIISISS SRTHSSIKID KDRYLSGNPI EVTVELRDEN DKPVKEQKQQ LNNAVSIDNV KPGVTTDWKE TADGVYKATY TAYTRGSGLT AKLLMQNWNE DLHTAGFIID ANPQSAKIAT LSASNNGVLA NENAANTVSV NVADEGSNPI NDHTVTFAVL SGSATCFNNQ NTAKTDVNGL ATFDLKSSKQ EDNTVEVTLE NGVKQTLIVS FVGDSSTAQV DLQKSKNEVV ADGNDSATMT ATVRDAKGNL LNDVKVTFNV NSAAAKLSQT EVNSHDGIAT ATLTSLKNGD YRVTASVSSG SQANQQVIFI GDQSTAALTL SVPSGDITVT NTAPLHMTAT LQDKNGNPLK DKEITFSVPN DVASRFSISN SGKGMTDSNG TAIASLTGTL AGTHMITARL ANSNVSDTQP MTFVADKDRA VVVLQTSKAE IIGNGVDETT LTATVKDPSN HPVAGITVNF TMPQDVAANF TLENNGIAIT QANGEAHVTL KGKKAGTHTV TATLGNNNTS DSQPVTFVAD KTSAQVVLQM SKDEITGNGV DNATLTATVK DQFDNEVNNL PVTFSSASSG LTLTPGVSNT NESGIAQATL AGVAFGEQTV TASLANNGAS DNKTVHFIGD TAAAKIIELT PVPDSIIAGT PQNSSGSVIT ATVVDNNGFP VKGVTVNFTS RTNSAEMTNG GQAVTNEQGK ATVTYTNTRS SIESGARPDT VEASLENGSS TLSTSINVNA DASTAHLTLL QALFDTVSAG DTTNLYIEVK DNYGNGVPQQ EVTLRVSPSE GVTPSNNAIY TTNHDGNFYT SFTATKAGVY QVTATLENGD SMQQTVTYVP NVANAEITLA ASKDPVIADN NDLTTLTATV ADTEGNAIAN TEVTFTLPED VKANFTLSDG GKAITDAEGK AKVTLKGTKA GAHTVTASMT GGKSEQLVVN FIADTLTAQV NLNVTEDNFI ANNVGMTRLQ ATVTDGNGNP LANEAVTFTL PADVSASFTL GQGGSAITDI NGKAEVTLSG TKSGTYPVTV SVNNYGVSDT KQVTLIADAG TAKLASLTSV YSFVVSTTEG ATMTASVTDT NGNPVEGIKV NFRGTSVTLS STSVETDDRG FAEILVTSTE VGLKTVSASL ADKPTEVISR LLNASADVNS ATITSLEIPE GQVMVAQDVA VKAHVNDQFG NPVAHQPVTF SAEPSSQMII SQNTVSTNTQ GVAEVTMTPE RNGSYMVKAS LANGASLEKQ LEAIDEKLTL TASSPLIGVY APTGATLTAT LTSANGTPVE GQVINFSVTP EGATLSGGKV RTNSSGQAPV VLTSNKVGTY TVTASFHNGV TIQTQTTVKV TGNSSTAHVA SFIADPSTIA ATNTDLSTLK TTVEDGSGNL IEGLTVYFAL KSGSATLTSL TAVTDQNGIA TTSVKGAMTG SVTVSAVTTA GGMQTVDITL VAGPADTSQS VLKSNRSSLK GDYTDSAELR LVLHDISGNP IKVSEGMEFV QSGTNVPYIK ISAIDYSLNI NGDYKATVTG GGEGIATLIP VLNGVHQAGL STTIQFTRAE DKIMSGTVSV NGTDLPTTTF PSQGFTGAYY QLNNDNFAPG KTAADYEFSS SASWVDVDAT GKVTFKNVGS NWERITATPK SGGPSYIYEI RVKSWWVNSG DAFMIYSLAE NFCSSNGYTL PRADHLNHSR SRGIGSLYSE WGDMGHYTTE AGFQSNMYWS SSPANSSEQY VVSLATGDQS VFEKLGFAYA TCYKNL
|
| |