Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1533 |
Symbol | |
ID | 6143700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1516732 |
End bp | 1517988 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616410 |
Product | hypothetical protein |
Protein accession | YP_001743588 |
Protein GI | 170682120 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000116235 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00188577 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGATCTG ATGCGAAAAA CTTGATGAGC GACGGGAACG TGCAAATTGT TAAGACCGGC GAGGTCATTG GCGCGACGCA ACTTACTGAA GGCGAGTTAA TTGTTGAGGC TGGCGGAAGA GCCGAAAATA CCGTGGTCAC GGGGGCTGGC TGGTTGAAAG TGGCAACCGG TGGGATCGCC AAATGCACAC AGTACGGCAA CAATGGCACG CTATCGGTCA GCGACGGTGC CATTGCCACA GATATTGTTC AGTCCGAGGG AGGCGCAATT AGTCTCTCTA CGCTCGCTAC GGTTAATGGC CGCCATCCCG AAGGTGAATT CAGCGTTGAT AAAGGTTATG CCTGCGGTTT GTTGCTGGAA AATGGCGGTA ACCTGCGTGT ACTGGAAGGC CATCGCGCGG AAAAAATTAT TCTCGATCAA GAGGGTGGCC TGTTGGTCAA TGGGACAACC TCAGCGGTCG TGGTAGATGA AGGTGGTGAA TTGTTGGTGT ATCCAGGTGG GGAAGCCAGC AATTGTGAGA TTAATCAGGG CGGCGTTTTT ATGCTGGCGG GGAAAGCCAA TGATACGTTG CTTGCTGGTG GCACCATGAA TAATCTCGGT GGTGAAGACT CTGACACTAT TGTTGAGAAT GGAGCCATCT ATCGTCTGGG GACGGATGGT CTTCAGCTCT ACAGTTCCGG TAAGACGCAA AACCTGTCCG TTAATGTGGG TGGTCGGGCT GAAGTGCATG CCGGTACGCT GGAAAATGCG GTAATACAAG GTGGGACAGT GATCCTGTTG TCACCCACCA GCGCGGACGA AAATTTTGTC GTAGAGGAAG ATCGCGCACC GGTTGAACTG ACCGGTAGTG TTGCATTACT GGACGGCGCT TCAATGATTA TTGGCTATGG CGCAGATCTG CAACAATCAA CGATTACTGT ACAGCAGGGC GGTGTATTGA TTCTCGACGG CAGTACGATA AAAGGTGACA GTGTCACTTT CAGTGTTGGT AACATCAATC TCAATGGCGG AAAACTGTGG CTAATCACTG GTGCGGCAAC GCATGTGCAA CTTAAAGTGA AACGCCTGCG CGGAGAGGGA GCGATTTGCC TGCAAACCAG TGCGAAAGAA ATTTCACCTG ACTTCATCAA TGTGAAAGGG GAAGTTACTG GTGATATACA CGTTGAGATA ACAGATGCCA GTCGGCAAAC TCTGTGTAAC GCACTGAAAC TACAGCCAGA CGAAGACGGG ATTGGCGCAA CGCTCCAGCC TGCGTAA
|
Protein sequence | MGSDAKNLMS DGNVQIVKTG EVIGATQLTE GELIVEAGGR AENTVVTGAG WLKVATGGIA KCTQYGNNGT LSVSDGAIAT DIVQSEGGAI SLSTLATVNG RHPEGEFSVD KGYACGLLLE NGGNLRVLEG HRAEKIILDQ EGGLLVNGTT SAVVVDEGGE LLVYPGGEAS NCEINQGGVF MLAGKANDTL LAGGTMNNLG GEDSDTIVEN GAIYRLGTDG LQLYSSGKTQ NLSVNVGGRA EVHAGTLENA VIQGGTVILL SPTSADENFV VEEDRAPVEL TGSVALLDGA SMIIGYGADL QQSTITVQQG GVLILDGSTI KGDSVTFSVG NINLNGGKLW LITGAATHVQ LKVKRLRGEG AICLQTSAKE ISPDFINVKG EVTGDIHVEI TDASRQTLCN ALKLQPDEDG IGATLQPA
|
| |