Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0396 |
Symbol | |
ID | 5589161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 424630 |
End bp | 427398 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640924120 |
Product | outer membrane autotransporter |
Protein accession | YP_001461547 |
Protein GI | 157157315 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.381376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCTGGAG ATTCAGGGGG GCAGTCTAGC AACTATGTAA ACTATAGTGG TTTTGTCTAT TACAACAACA CCAATGGTGA TTTCGATCAG TCCTTTAACG GCGATACCGT TAACGGGACA ATCTCAACCT ATTATTTGAA CCATGATTAT GCAGACAGTA CTGCTAATCA GCTTGATATC AGTAATTCAG TGATTCACGG TTCGATTACT TCTATGCTGC CTGGCGGTTA TTATGATCGT TTTGATGCAG ATGGTAATAA TCTGGGTGGA TATGATTTTT ACACTGATGC GGTTGTTGAT ACACACTGGC GTGATGGTGA TGTTTTCACT TTGAACATTG CTAACACTAC TATTGATGAT GATTATGAAG CTCTTTACTT CACTGATTCT TATAAAGATG GTGATGTAAC CAAGCACACA AATGAGACAT TTGATACAAG TGAAGGCGTT GCTGTTAATC TTGATGTAGA AAGTAACATC AATATTTCCA ATAACTCCCG CGTTGCAGGT ATTGCATTAT CTCAAGGTAA TACTTACAAC GAAACCTACA CTACCGAATC TCATACTTGG GATAACAATA TCTCTGTAAA AGATTCCACA GTGACTTCGG GTTCAAATTA TATCCTGGAT AGCAATACTT ATGGCAAAAC TGGTCACTTT GGCAATTCTG ATGAACCGAG TGATTATGCT GGCCCGGGTG ATGTTGCAAT GTCCTTTACT GCTTCAGGTT CCGACTATGC GATGAAGAAC AATGTATTCC TCAGCAATTC AACGCTGATG GGTGATGTTG CCTTTACCAG CACCTGGAAT AGTAATTTTG ATCCGAATGG TCATGATTCC AACGGTGACG GGGTGAAAGA TACCAACGGG GGTTGGACTG ATGATAGCCT CAACGTTGAT GAACTAAATC TCACTCTCGA TAACGGAAGC AAGTGGGTTG GTCAGGCAAT TTATAACGTT GCTGAAACGT CAGCAATGTA TGATGTTGCT ACAAACAGCC TTACTCCTGA TGCAACATAT GAAAACAATG ACTGGAAACG TGTTGTTGAT GACAAGGTCT TCCAGAGCGG TGTATTTAAC GTAGCGTTGA ATAACGGTTC TGAATGGGAT ACTACAGGTC GTTCCATCGT TGATACCTTG ACAGTTAATA ATGGTTCTCA GGTTAATGTT TCGGAATCTA AATTAACTTC AGATACTATC GATTTAACTA ACGGTTCTTC GCTGAACATT GGTGAAGATG GCTACGTTGA TACCGATCAT CTGACTATTA ACTCCTACAG TACTGTTGCG TTGACCGAAT CTACTGGGTG GGGGGCTGAT TACAACCTGT ACGCCAATAC TATCACCGTA ACTAACGGTG GTGTATTGGA TGTGAACGTT GATCAGTTCG ATACTGAAGC TTTCCGTACT GACAAACTGG AACTGACCAG CGGCAACATC GCTGACCATA ACGGTAACGT AGTATCTGGT GTGTTCGATA TCCATAGCAG CGATTACGTT CTGAACGCTG ATCTGGTGAA CGACCGTACG TGGGATACTT CCAAGTCTAA CTACGGTTAC GGTATTGTTG CTATGAACTC TGACGGTCAC CTGACTATCA ATGGTAACGG CGACGTAGAC AACGGTACTG AACTGGATAA CAGCTCTGTT GATAACGTTG TTGCTGCAAC CGGTAACTAC AAAGTTCGTA TCGACAACGC AACTGGCGCT GGCGCTATCG CTGATTACAA AGATAAAGAA ATTATCTACG TAAACGACGT CAACACCAAC GCGACCTTCT CTGCTGCTAA CAAAGCTGAC CTGGGTGCAT ACACCTATCA GGCTGAACAG CGCGGTAACA CCGTTGTTCT GCAACAGATG GAGCTGACCG ACTACGCTAA CATGGCGCTG AGCATCCCGT CTGCGAACAC CAATATCTGG AACCTGGAAC AAGACACCGT TGGTACTCGT CTGACCAACT CTCGTCATGG CCTGGCTGAT AACGGCGGCG CATGGGTAAG CTACTTCGGT GGTAACTTCA ACGGCGACAA CGGCACCATC AACTATGATC AGGATGTTAA CGGCATCATG GTCGGTGTTG ATACCAAAAT TGACGGTAAC AACGCTAAGT GGATCGTCGG TGCGGCTGCA GGCTTCGCTA AAGGTGACAT GAATGACCGT TCTGGTCAGG TGGATCAAGA CAGCCAGACT GCCTACATCT ACTCTTCTGC TCACTTCGCG AACAACGTCT TTGTTGATGG TAGCTTGAGT TACTCTCACT TCAACAACGA CCTGTCTGCA ACCATGAGCA ACGGTACTTA CGTTGACGGT AGCACCAACT CCGACGCTTG GGGCTTCGGT TTGAAAGCCG GTTACGACTT CAAACTGGGT GATGCTGGTT ACGTGACTCC TTACGGCAGC ATTTCTGGTC TGTTCCAGTC TGGTGATGAC TACCAGCTGA GCAACGACAT GAAAGTTGAC GGTCAGTCTT ACGACAGCAT GCGTTATGAA CTGGGTGTAG ATGCAGGTTA TACCTTCACC TACAGCGAAG ACCAGGCTCT GACTCCGTAC TTCAAACTGG CTTACGTCTA CGACGACTCT AACAACGATA ACGATGTGAA CGGTGATTCC ATCGATAACG GTACTGAAGG GTCTGCGGTA CGTGTTGGTC TGGGTACTCA GTTCAGCTTC ACCAAGAACT TCAGCGCCTA TACCGATGCT AACTACCTCG GTGGTGGTGA CGTAGATCAA GACTGGTCCG CGAACGTGGG TGTTAAATAT ACCTGGTAA
|
Protein sequence | MSGDSGGQSS NYVNYSGFVY YNNTNGDFDQ SFNGDTVNGT ISTYYLNHDY ADSTANQLDI SNSVIHGSIT SMLPGGYYDR FDADGNNLGG YDFYTDAVVD THWRDGDVFT LNIANTTIDD DYEALYFTDS YKDGDVTKHT NETFDTSEGV AVNLDVESNI NISNNSRVAG IALSQGNTYN ETYTTESHTW DNNISVKDST VTSGSNYILD SNTYGKTGHF GNSDEPSDYA GPGDVAMSFT ASGSDYAMKN NVFLSNSTLM GDVAFTSTWN SNFDPNGHDS NGDGVKDTNG GWTDDSLNVD ELNLTLDNGS KWVGQAIYNV AETSAMYDVA TNSLTPDATY ENNDWKRVVD DKVFQSGVFN VALNNGSEWD TTGRSIVDTL TVNNGSQVNV SESKLTSDTI DLTNGSSLNI GEDGYVDTDH LTINSYSTVA LTESTGWGAD YNLYANTITV TNGGVLDVNV DQFDTEAFRT DKLELTSGNI ADHNGNVVSG VFDIHSSDYV LNADLVNDRT WDTSKSNYGY GIVAMNSDGH LTINGNGDVD NGTELDNSSV DNVVAATGNY KVRIDNATGA GAIADYKDKE IIYVNDVNTN ATFSAANKAD LGAYTYQAEQ RGNTVVLQQM ELTDYANMAL SIPSANTNIW NLEQDTVGTR LTNSRHGLAD NGGAWVSYFG GNFNGDNGTI NYDQDVNGIM VGVDTKIDGN NAKWIVGAAA GFAKGDMNDR SGQVDQDSQT AYIYSSAHFA NNVFVDGSLS YSHFNNDLSA TMSNGTYVDG STNSDAWGFG LKAGYDFKLG DAGYVTPYGS ISGLFQSGDD YQLSNDMKVD GQSYDSMRYE LGVDAGYTFT YSEDQALTPY FKLAYVYDDS NNDNDVNGDS IDNGTEGSAV RVGLGTQFSF TKNFSAYTDA NYLGGGDVDQ DWSANVGVKY TW
|
| |