Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2424 |
Symbol | |
ID | 6068472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2669655 |
End bp | 2672522 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641601833 |
Product | outer membrane autotransporter |
Protein accession | YP_001725385 |
Protein GI | 170020431 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.33795 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATCA AACAACACAA TGGGAATACC AAAGCAGATC GTCTCGCTGA ATTAAAAATC CGTTCGCCCT CAATTCAACT GATAAAATTT GGCGCTATTG GTTTGAATGC AATTATATTT TCCCCCCTGC TGATAGCTGC TGATACAGGA AGTCAATATG GCACCAATAT TACTATTAAT GATGGTGACA GAATTACAGG AGATACCGCC GATCCATCAG GAAACCTCTA TAGTGTAATG ACCCCAGCAG GAAACACGCC TGGCAATATC AACCTGGGTA ATGATGTCAC CGTCAATGTC AACGACGCCT CTGGATATGC AAAAGGAATC ATTATTCAGG GCAAAAACAG CTCCCTGACA GCTAACCGAC TCACAGTAGA TGTTGTTGGT CAAACCTCTG CCATCGGCAT TAACTTAATT GGTGACTATA CCCATGCTGA CTTAGGCACA GGCAGCACCA TTAAGAGTAA CGATGACGGC ATCATTATTG GGCATAGCTC AACACTAACA GCCACTCAAT TCACCATTGA AAACTCGAAC GGTATAGGCC TAACCATCAA TGACTATGGC ACCAGTGTCG ATCTTGGAAG CGGAAGTAAA ATCACGACCG ATGGAAGTAC AGGTGTTTAT ATCGGTGGTC TCAACGGCAA TAACGCCAAT GGTGCTGCGC GTTTTACGGC TACAGACCTG ACAATCGATG TTCAGGGCTA CAGCGCCATG GGGATAAACG TACAGAAAAA CTCTGTTGTC GATCTCGGAA CAAACAGTAC CATTAAAACC AATGGCGATA ATGCTCACGG CCTCTGGAGC TTTGGCCAGG TTAGCGCGAA TGCACTCACT GTTGATGTAA CTGGAGCCGC GGCCAATGGC GTCGAAGTTC GTGGTGGTAC AACCACTATC GGTGCAGATA GCCATATTTC TTCCGCGCAG GGCGGTGGCC TCGTCACCAG TGGTTCAGAC GCGATAATCA ATTTTACTGG CACGGCAGCG CAACGAAACA GCATCTTTTC CGGCGGTTCT TATGGTGCCT CGGCCCAGAC GGCAACGGCT GTTGTCAACA TGCAAAATAC CGATATTACA GTTGATCGTA ATGGCAGTCT GGCGCTGGGT TTGTGGGCTC TCAGCGGCGG TAGAATAACC GGAGACAGTT TGGCTATCAC CGGCGCGGCA GGAGCCAGGG GAATTTATGC CATGACCAAC AGCCAGATCG ACCTCACGAG CGATCTGGTC ATTGATATGA GTACACCCGA CCAGATGGCC ATCGCAACGC AACATGACGA TGGTTATGCC GCCAGCCGCA TCAACGCCTC GGGTCGTATG CTTATCAACG GTAGCGTTCT TTCCAAAGGT GGGCTAATCA ATCTGGATAT GCACCCTGGG TCGGTTTGGA CAGGTTCCTC CCTCAGCGAT AATGTCAATG GCGGAAAACT GGACGTTGCA ATGAATAACA GCGTCTGGAA CGTAACAAGT AATTCTAATC TCGACACGCT GGCGCTGAGC CATTCAACTG TCGATTTTGC CAGCCACGGG TCAACTGCCG GCACATTTGC CACATTAAAC GTAGAGAACC TGAGCGGTAA CAGTACCTTT ATTATGCGTG CTGATGTTGT TGGCGAGGGT AATGGCGTTA ATAATAAAGG GGATTTATTG AATATCAGCG GGAGTAGTGC TGGTAATCAC GTATTGGCTA TCCGCAACCA GGGCAGCGAG GCCACAACGG GAAATGAAGT TCTGACAGTG GTAAAAACCA CTGACGGCGC GGCCTCGTTC AGCGCGTCTT CTCAGGTTGA GTTGGGGGGA TATCTGTACG ATGTGCGTAA AAATGGCACT AACTGGGAGC TTTACGCTTC CGGGACAGTT CCGGAACCGA CTCCTAATCC TGAACCCACA CCAGCTCCCG CTCAGCCTCC CATAGTCAAC CCCGATCCTA CGCCTGAACC CGCTCCCACG CCTAAACCCA CCACGACCGC AGATGCTGGC GGCAATTATC TCAATGTCGG TTACTTATTG AACTATGTTG AAAACCGTAC GCTGATGCAA CGGATGGGTG ACCTGCGAAA TCAGAGTAAA GACGGTAATA TCTGGTTGCG CAGTTATGGG GGAAGCCTGG ACTCCTTTGC CAGTGGCAAA CTGAGCGGCT TTGACATGGG TTACAGCGGT ATCCAGTTTG GTGGGGATAA ACGTCTCTCT GATGTAATGC CGTTGTATGT CGGTCTGTAT ATTGGCTCAA CACATGCATC GCCGGACTAT AGCGGAGGCG ACGGTACCGC ACGTTCAGAC TACATGGGAA TGTACGCCAG TTACATGGCA CAAAACGGTT TTTACAGCGA TCTCGTTATA AAAGCATCGC GCCAGAAAAA TAGTTTCCAC GTACTGGACA GTCAGAACAA CGGCGTTAAC GCCAACGGCA CTGCGAATGG AATGAGCATC TCCCTGGAAG CCGGGCAGAG GTTCAACCTG TCCCCTACTG GTTATGGGTT CTATATAGAG CCGAAAACCC AGCTTACATA CAGCCACCAG AATGAGATGA CTATGAAGGC GAGTAATGGC CTCAATATAC ATCTGAATCA CTACGAATCG CTGCTGGGGC GTGCCAGCAT GATACTGGGG TATGACATCA CCGCAGGCAA CAGCCAGCTG AATGTCTATG TGAAGACTGG CGCTATCCGC GAGTTTTCAG GGGATACCGA ATATCTGTTG AACAACTCCC GGGAGAAGTA CAGTTTCAAA GGTAATGGCT GGAATAACGG CGTGGGAGTC AGTGCACAGT ATAACAAACA GCACACATTC TATCTCGAAG CGGATTACAC GCAGGGTAAC CTCTTTGATC AGAAGCAAGT CAACGGAGGA TATCGCTTCA GCTTTTAA
|
Protein sequence | MGIKQHNGNT KADRLAELKI RSPSIQLIKF GAIGLNAIIF SPLLIAADTG SQYGTNITIN DGDRITGDTA DPSGNLYSVM TPAGNTPGNI NLGNDVTVNV NDASGYAKGI IIQGKNSSLT ANRLTVDVVG QTSAIGINLI GDYTHADLGT GSTIKSNDDG IIIGHSSTLT ATQFTIENSN GIGLTINDYG TSVDLGSGSK ITTDGSTGVY IGGLNGNNAN GAARFTATDL TIDVQGYSAM GINVQKNSVV DLGTNSTIKT NGDNAHGLWS FGQVSANALT VDVTGAAANG VEVRGGTTTI GADSHISSAQ GGGLVTSGSD AIINFTGTAA QRNSIFSGGS YGASAQTATA VVNMQNTDIT VDRNGSLALG LWALSGGRIT GDSLAITGAA GARGIYAMTN SQIDLTSDLV IDMSTPDQMA IATQHDDGYA ASRINASGRM LINGSVLSKG GLINLDMHPG SVWTGSSLSD NVNGGKLDVA MNNSVWNVTS NSNLDTLALS HSTVDFASHG STAGTFATLN VENLSGNSTF IMRADVVGEG NGVNNKGDLL NISGSSAGNH VLAIRNQGSE ATTGNEVLTV VKTTDGAASF SASSQVELGG YLYDVRKNGT NWELYASGTV PEPTPNPEPT PAPAQPPIVN PDPTPEPAPT PKPTTTADAG GNYLNVGYLL NYVENRTLMQ RMGDLRNQSK DGNIWLRSYG GSLDSFASGK LSGFDMGYSG IQFGGDKRLS DVMPLYVGLY IGSTHASPDY SGGDGTARSD YMGMYASYMA QNGFYSDLVI KASRQKNSFH VLDSQNNGVN ANGTANGMSI SLEAGQRFNL SPTGYGFYIE PKTQLTYSHQ NEMTMKASNG LNIHLNHYES LLGRASMILG YDITAGNSQL NVYVKTGAIR EFSGDTEYLL NNSREKYSFK GNGWNNGVGV SAQYNKQHTF YLEADYTQGN LFDQKQVNGG YRFSF
|
| |