Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1878 |
Symbol | |
ID | 5587846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1870689 |
End bp | 1871945 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640925554 |
Product | hypothetical protein |
Protein accession | YP_001462959 |
Protein GI | 157156334 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000220122 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATCTG ATGCGAAAAA CTTGATGAGC GACGGGAATG TGCAAATTGT TAAGACCGGC GAGGTCATTG GCGCGACGCA ACTTACTGAA GGCGAGTTGA TTGTTGAAGC TGGCGGAAGA GCCGAAAATA CCGTGGTCAC GGGGGCTGGC TGGTTGAAAG TGGCAACCGG TGGGATCGCC AAATGCACAC AGTACGGTAA CAATGGCACG CTATCGGTCA GCGACGGTGC CATTGCCACA GATATTGTTC AGTCCGAGGG AGGCGCAATT AGTCTCTCTA CGCTCGCTAC GGTTAATGGC CGCCATCCCG AAGGTGAATT CAGCGTTGAT CAGGGTTATG CCTGCGGTTT GTTGCTGGAA AATGGCGGTA ACCTGCGTGT ACTGGAAGGG CATCGCGCGG AAAAAATCAT TCTCGATCAA GAGGGCGGCC TGTTGGTTAA TGGGACAACC TCAGCGGTCG TGGTAGATGA AGGTGGTGAA TTGTTGGTGT ATCCAGGTGG GGAAGCCAGC AATTGTGAGA TTAATCAGGG CGGCGTTTTT ATGCTGGCCG GGAAAGCCAG TGATACGTTG CTTGCTGGTG GCACCATGAA TAATCTCGGT GGTGAAGACT CTGACACTAT TGTTGAGAAT GGATCCATTT ATCGTCTGGG GACGGATGGC CTTCAGCTCT ACAGTTCCGG TAAGACGCAA AACCTGTCCG TGAATGTGGG TGGTCGGGCT GAAGTGCATG CCGGTACGCT GGAAAATGCG GTAATACAAG GTGGAACAGT GATCCTGTTG TCACCCACCA GCGCGGACGA AAATTTTGTC GTAGAGGAAG ATCGCGCACC GGTTGAACTG ACCGGGAGTG TTGCATTACT GGACGGCGCT TCAATGATTA TTGGCTATGG CGCAGATCTG CAACAATCAA CGATTACTGT ACAGCAGGGC GGTGTGTTGA TTCTCGACGG CAGTACGGTA AAAGGTGACG GTGTCACTTT TATTGTTGGT AACATCAATC TGAATGGCGG AAAACTGTGG CTGATCACTG GTGCGGCAAC GCATGTGCAA CTGAAAGTGA AACGCCTGCG CGGAGAGGGA GCGATTTGCC TGCAAACCAG TGCGAAAGAA ATCTCACCTG ACTTCATCAA TGTGAAAGGG GAAGTTACCG GGGATATACA CGTTGAGATA ACAGATGCCA GTCGGCAAAC TCTGTGCAAC GCTCTGAAAT TACAGCCAGA CGAAGACGGG ATTGGCGCAA CGCTCCAGCC TGCGTAA
|
Protein sequence | MGSDAKNLMS DGNVQIVKTG EVIGATQLTE GELIVEAGGR AENTVVTGAG WLKVATGGIA KCTQYGNNGT LSVSDGAIAT DIVQSEGGAI SLSTLATVNG RHPEGEFSVD QGYACGLLLE NGGNLRVLEG HRAEKIILDQ EGGLLVNGTT SAVVVDEGGE LLVYPGGEAS NCEINQGGVF MLAGKASDTL LAGGTMNNLG GEDSDTIVEN GSIYRLGTDG LQLYSSGKTQ NLSVNVGGRA EVHAGTLENA VIQGGTVILL SPTSADENFV VEEDRAPVEL TGSVALLDGA SMIIGYGADL QQSTITVQQG GVLILDGSTV KGDGVTFIVG NINLNGGKLW LITGAATHVQ LKVKRLRGEG AICLQTSAKE ISPDFINVKG EVTGDIHVEI TDASRQTLCN ALKLQPDEDG IGATLQPA
|
| |