Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1958 |
Symbol | |
ID | 6142645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1980286 |
End bp | 1981593 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641616834 |
Product | hypothetical protein |
Protein accession | YP_001744010 |
Protein GI | 170682033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.272689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.538677 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTATC TCAGCCCGCA AAAATTCAGC TGGGGTGATG CCCCCTGGCA GATCATTGAC CTTAGTACCG CGGGCAAAGT GAACATACAG GTGGACAACA ATACCATCAT CACGTTGGGG ACTCCCCTGA ATCAGCAACA TAACGAGTTC ATGATGGTCG CCAAATGGTG CGAATGGGCT ATACAGCAGG ATGGTTTGCA AGAAAACCTG CAAAAAAAAC TGCACGAGGT CCTGGAAGAA AACCGGCAAA ACAAACAGTC AGAAATTCCC CAAGAAGACC TGAAAGAAAG GCTGAAAGAA ATCAAGGAAG ATATCCTGAA AGAAAACCAG TCAGCAAGCC AGATTGAAGA CCGGGCAGAA GCATTGCGCA GAATGAAGGA ATGTCTCATA ACAAGACAGA GTATGCTCGA TCTTAGCAAC CTTGGACTGA CTTCACTCCC TGAAAATTTG CCTCCACATC TGATTGAATT TAACTGCAGT AGAAACATGT TGACCGCGTT ACCGGAGGTA ATGCCAAAGG GGCTGAGAGT GCTTGAATGT ATGGAGAACT TTTTGATCTT GTTACCGAAG GTGCAGCCCC CGAAACTGAT GGTACTGAAG TGCTATGAAA ACTATATTAT CTGGCTGCCT GAGCTGTCGA CTAACCTGAG AGTGATTGAC TGTTCTGAAA ACTTCTTGCA ATTTTTACCG CCGTCGATGC CCCAGTACCT GTATACACTG CGCTGTGCTT TCAACAGTAT TAGCTTAATA CCTGATGAGA TGCTGGAGAA CTTGACTCGC CTGAAGGTAT TTGACTGTTC TAGTAACGAT TTGATCTCTT CACCACGGCT GCCGCCCAAA CTAATCATAT ACTACTGTGG AGAAAACCAG TTTAAAACTG TACCGGTGCC GCAGCCCCGA AGCCTGAAGG TGTTTAGCTG TAATGGTAAC CCGTGGGACA AAGACAATTT ACCGACGCTG CTCAAAGCCG TCGAGGGCCT GAAAAACCAG GAGGGTCTGG AGGAGCTTTT GGACTTTTTG CACAAGGAAG GTCTGGTTGA CCTGGAAGGA CTCGAGGAGC TGGAGGACCT GGATGACCTT ATGGATCTGG AGTTCCTGGA TGACCCGGAA CTCCTGGAGC GCGTGAAGGT ACAGGAGGAC CTGGAGCTCC TGGATCAACA GTTGGGCCTG TTGAGTCTGG AAAAACAACA GGACTCGCAG CCTGTTAATC AACAATCTGA ACATGAACCC GAATCTGCAT CAAAGGTGAA GCGTGATTTA TCTGAAGTCG ACTCCGAGTC AACAATGAAG CGTAAGCGTT TTATGTAG
|
Protein sequence | MKYLSPQKFS WGDAPWQIID LSTAGKVNIQ VDNNTIITLG TPLNQQHNEF MMVAKWCEWA IQQDGLQENL QKKLHEVLEE NRQNKQSEIP QEDLKERLKE IKEDILKENQ SASQIEDRAE ALRRMKECLI TRQSMLDLSN LGLTSLPENL PPHLIEFNCS RNMLTALPEV MPKGLRVLEC MENFLILLPK VQPPKLMVLK CYENYIIWLP ELSTNLRVID CSENFLQFLP PSMPQYLYTL RCAFNSISLI PDEMLENLTR LKVFDCSSND LISSPRLPPK LIIYYCGENQ FKTVPVPQPR SLKVFSCNGN PWDKDNLPTL LKAVEGLKNQ EGLEELLDFL HKEGLVDLEG LEELEDLDDL MDLEFLDDPE LLERVKVQED LELLDQQLGL LSLEKQQDSQ PVNQQSEHEP ESASKVKRDL SEVDSESTMK RKRFM
|
| |