Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1418 |
Symbol | |
ID | 6067759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1552399 |
End bp | 1556103 |
Gene Length | 3705 bp |
Protein Length | 1234 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641600837 |
Product | adhesin |
Protein accession | YP_001724408 |
Protein GI | 170019454 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01376] Chlamydial polymorphic outer membrane protein repeat [TIGR01414] outer membrane autotransporter barrel domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000765921 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTGCAT CTCTTTTCTC TGCTAACGGT GTCGCGGCGG TCACTGATTC ATGCCAGGGA TATGATGTCA AAGCGAGTTG TCAGGCCAGC AGGCAAAGCC TTTCAGGCAT TACGCAGGAC TGGAGTATCG CTGATGGGCA ATGGCTGGTT TTTTCGGATA TGACCAATAA CGCCAGCGGT GGGGCCGTAT TTTTGCAACA AGGAGCGGAA TTTTCACTAT TACCAGAAAA TGAAACTGGA ATGACTCTGT TTGCCAATAA CACCGTTACA GGAGAATATA ATAACGGCGG GGCCATATTT GCTAAAGAAA ACTCAACGCT GAATCTTACT GATGTTATTT TTTCCGGTAA CGTCGCAGGC GGCTATGGTG GCGCAATCTA TTCTTCTGGT ACTAACGATA CTGGTGCCGT CGATTTACGT GTCACTAACG CCATGTTTCG CAATAACATC GCTAATGATG GCAAAGGTGG CGCAATTTAT ACCATTAATA ATGACGTTTA TTTAAGTGAT GTTATTTTTG ATAACAACCA GGCATATACA TCAACAAGTT ACAGTGATGG CGATGGCGGG GCAATCGATG TTACCGATAA TAATAGCGAC AGCAAGCATC CTTCAGGTTA TACGATAGTA AATAACACTG CCTTTACAAA TAACACTGCC GAAGGTTATG GCGGGGCGAT ATATACCAAT AGCGTGACGG CTCCCTATCT TATTGATATT TCTGTTGATG ACAGCTACAG CCAGAACGGA GGCGTGTTAG TCGATGAGAA CAATAGCGCA GCAGGCTATG GAGATGGTCC TTCCTCTGCG GCGGGTGGCT TTATGTATCT CGGCTTAAGT GAAGTTACCT TTGATATTGC CGACGGAAAA ACGCTGGTTA TTGGCAATAC AGAGAATGAC GGAGCTGTTG ACTCTATTGC TGGTACCGGG TTAATCACCA AAACAGGTTC CGGCGATCTG GTACTTTATG CAGATAACAA TGACTTTACT GGTGAGATGC AGATTGAAAA CGGTGAAGTT ACCCTGGGCC GCAGCAACTC CCTGATGAAT GTCGGCGATA CGCATTGCCA GGACGATCCG CAAGACTGCT ACGGTCTGAC GATAGGGAGT ATTGATCAGT ATCAGAATCA GGCTGAGCTA AACGTTGGCT CGACCCAACA AACTTTTGTG CACGCATTGA CGGGCTTTCA GAATGGCACT TTAAATATCG ATGCTGGTGG CAACGTTACT GTTAATCAGG GCAGTTTTGC TGGCATCATC GAAGGTGCTG GTCAGCTCAC CATTGCGCAA AACGGCAGCT ACGTGCTGGC AGGGGCGCAG TCGATGGCGC TAACCGGCGA TATAGTCGTT GATGATGGTG CGGTGCTTTC GCTGGAAGGC GACGCGGCAG ATCTTACCGC TCTCCAGGAC GATCCGCAGT CGATCGTGTT AAACGGCGGT GTGCTCGATC TCTCTGATTT CTCCACCTGG CAGAGCGGCA CATCATACAA CGATGGCCTT GAAGTCAGTG GCAGCAGCGG AACGGTTATC GGCAGTCAGG ATGTGGTAGA TCTTGCAGGT GGCGACAATT TGCATATCGG CGGCGACGGG AAAGATGGCG TCTACGTGGT GGTCGATGCG AGCGACGGGC AGGTAAGTCT GGCAAACAAT AATAGTTATT TGGGCACAAC ACAAATCGCC TCCGGTACGC TGATGGTGAG CGACAACTCG CAGCTTGGAG ATACCCACTA TAACCGCCAG GTTATCTTTA CCGATAAGCA ACAAGAAAGC GTGATGGAGA TTACCTCCGA CGTTGACACG CGTTCAGATG CGGCAGGCCA CGGACGTGAT ATTGAAATGC GCGCCGACGG TGAAGTGGCA GTTGATGCGG GGGTAGACAC GCAGTGGGGC GCACTGATGG CTGACAGCAG CGGGCAGCAT CAGGATGAGG GTAGCACATT GACTAAAACG GGGGCTGGTA CACTGGAGCT GACCGCCAGC GGTACAACGC AGTCGGCGGT ACGTGTCGAA GAAGGCACCC TGAAAGGTGA TGTTGCTGAT ATCCTTCCTT ATGCTTCGTC ACTGTGGGTT GGTGATGGGG CAACGTTCGT TACTGGCGCG GATCAGGATA TTCAGTCAAT TGATGCTATT TCCAGCGGCA CTATCGACAT CAGCGATGGT ACGGTTTTGC GCCTGACCGG GCAGGATACT TCCGTCGCCC TTAATGCCTC ACTATTTAAC GGCGATGGGA CGCTGGTGAA TGCCACCGAT GGTGTGACGT TGACAGGTGA GCTTAATACC AACCTTGAAA CTGACAGCCT GACTTATCTT TCCAACGTGA CGGTTAATGG CAATCTGACC AATACGTCCG GTGCGGTTAG CCTGCAAAAT GGCGTCGCTG GCGATACGCT GACGGTAAAC GGTGATTATA CCGGCGGCGG TACGCTACTG CTCGATAGCG AATTAAACGG CGATGACTCG GTAAGCGATC AATTGGTGAT GAACGGTAAT ACTGCTGGCA ACACAACTGT GGTGGTTAAC TCCATTACAG GGATTGGTGA GCCGACATCG ACAGGCATTA AAGTGGTTGA TTTCGCAGCT GATCCCACGC AGTTTCAAAA CAATGCGCAG TTCAGTCTGG CAGGCAGCGG CTACGTCAAT ATGGGAGCGT ATGACTACAC GCTGGTGGAA GATAACAACG ACTGGTATCT GCGATCGCAA GAAGTAACGC CGCCATCGCC ACCTGAGCCA GACCCGACTC CCGATCCTGA TCCCACGCCG GATCCTGACC CAACCCCCGA CCCGGCCCCT GTGCCTGTTT ACCAGCCGGT GCTGAATGCC AAAGTTGGCG GTTATCTCAA TAACCTGCGG GCGGCAAATC AGGCGTTTAT GATGGAGCGA CGCGATCACG CAGGTGGCGA TGGTCAGACG CTGAATTTAC GTGTTATCGG CGGAGATTAT CATTACACAG CAGCGGGGCA ACTGGCTCAA CATGAAGACA CTTCTACGGT GCAACTTAGC GGCGATCTGT TTAGCGGGCG CTGGGGCACG GATGGCGAGT GGATGCTTGG GATTGTTGGT GGCTACAGCG ATAACCAGGG CGACAGCCGC TCGAATATGA CCGGAACTCG CGCCGATAAC CAGAACCACG GTTATGCCGT TGGGCTGACA TCAAGCTGGT TTCAGCACGG TAATCAGAAG CAAGGGGCCT GGCTGGATAG CTGGCTGCAA TACGCGTGGT TTAGCAATGA TGTTTCCGAA CAAGAAGATG GCACAGATCA TTACCACTCG TCGGGGATTA TCGCCTCGCT GGAGGCGGGG TATCAGTGGT TACCGGGGCG TGGTGTGGTG ATTGAACCGC AGGCGCAGGT GATTTATCAG GGCGTGCAGC AGGATGATTT TACCGCCGCT AACCGTGCGC GCGTGTCACA ATCGCAGGGT GATGATATTC AGACGCGGCT GGGTTTACAC AGCGAATGGC GTACCGCTGT TCATGTCATA CCAACATTAG ATCTGAATTA TTATCACGAT CCCCATTCGA CGGAAATTGA AGAGGATGGC AGCACTATCA GTGACGATGC GGTGAAGCAA CGGGGTGAAA TAAAAGTGGG AGTCAGGGGC AATATCAGTC AGCGAGTGTC CCTGCGCGGC AGCGTGGCGT GGCAGAAAGG GAGTGATGAT TTTGCCCAGA CGGCAGGGTT TTTGTCGATG ACGGTGAAAT GGTAA
|
Protein sequence | MIASLFSANG VAAVTDSCQG YDVKASCQAS RQSLSGITQD WSIADGQWLV FSDMTNNASG GAVFLQQGAE FSLLPENETG MTLFANNTVT GEYNNGGAIF AKENSTLNLT DVIFSGNVAG GYGGAIYSSG TNDTGAVDLR VTNAMFRNNI ANDGKGGAIY TINNDVYLSD VIFDNNQAYT STSYSDGDGG AIDVTDNNSD SKHPSGYTIV NNTAFTNNTA EGYGGAIYTN SVTAPYLIDI SVDDSYSQNG GVLVDENNSA AGYGDGPSSA AGGFMYLGLS EVTFDIADGK TLVIGNTEND GAVDSIAGTG LITKTGSGDL VLYADNNDFT GEMQIENGEV TLGRSNSLMN VGDTHCQDDP QDCYGLTIGS IDQYQNQAEL NVGSTQQTFV HALTGFQNGT LNIDAGGNVT VNQGSFAGII EGAGQLTIAQ NGSYVLAGAQ SMALTGDIVV DDGAVLSLEG DAADLTALQD DPQSIVLNGG VLDLSDFSTW QSGTSYNDGL EVSGSSGTVI GSQDVVDLAG GDNLHIGGDG KDGVYVVVDA SDGQVSLANN NSYLGTTQIA SGTLMVSDNS QLGDTHYNRQ VIFTDKQQES VMEITSDVDT RSDAAGHGRD IEMRADGEVA VDAGVDTQWG ALMADSSGQH QDEGSTLTKT GAGTLELTAS GTTQSAVRVE EGTLKGDVAD ILPYASSLWV GDGATFVTGA DQDIQSIDAI SSGTIDISDG TVLRLTGQDT SVALNASLFN GDGTLVNATD GVTLTGELNT NLETDSLTYL SNVTVNGNLT NTSGAVSLQN GVAGDTLTVN GDYTGGGTLL LDSELNGDDS VSDQLVMNGN TAGNTTVVVN SITGIGEPTS TGIKVVDFAA DPTQFQNNAQ FSLAGSGYVN MGAYDYTLVE DNNDWYLRSQ EVTPPSPPEP DPTPDPDPTP DPDPTPDPAP VPVYQPVLNA KVGGYLNNLR AANQAFMMER RDHAGGDGQT LNLRVIGGDY HYTAAGQLAQ HEDTSTVQLS GDLFSGRWGT DGEWMLGIVG GYSDNQGDSR SNMTGTRADN QNHGYAVGLT SSWFQHGNQK QGAWLDSWLQ YAWFSNDVSE QEDGTDHYHS SGIIASLEAG YQWLPGRGVV IEPQAQVIYQ GVQQDDFTAA NRARVSQSQG DDIQTRLGLH SEWRTAVHVI PTLDLNYYHD PHSTEIEEDG STISDDAVKQ RGEIKVGVRG NISQRVSLRG SVAWQKGSDD FAQTAGFLSM TVKW
|
| |