Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2204 |
Symbol | |
ID | 6144226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2216636 |
End bp | 2217868 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617080 |
Product | hypothetical protein |
Protein accession | YP_001744254 |
Protein GI | 170680727 |
COG category | [S] Function unknown |
COG ID | [COG3214] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0443538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.243717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTGC CGCACCTCTC TCTTGCTGAT GCGCGTAATC TTCACCTCGC CGCGCAAGGC CTGTTAAACA AACCCCGCCG TCGAGCGTCG TTGGAGGATA TTCCGGCAAC GATCTCCCGC ATGTCCTTGC TGCAAATCGA TACCATCAAT ATTGTTGCCC GTAGCCCATA TCTGGTGCTT TTCAGTCGTC TGGGAAATTA TCCTGCCCAG TGGCTGGATG AGTCTCTGGC GCGTGGCGAA TTAATGGAAT ACTGGGCGCA TGAAGCCTGC TTTATGCCAC GTAGCGATTT TCGTCTTATT CGCCACCGCA TGCTGGCACC TGAAAAAATG GGCTGGAAAT ACAAAGACGC CTGGATGCAG GAACATGAGG TGGAAATTGC ACAGTTAATT CAGCATATTC ATGATAAGGG GCCGGTACGT TCAGCCGATT TTGAGCATCC TCGTAAAGGG GCAAGCGGCT GGTGGGAGTG GAAGCCGCAT AAACGACATC TGGAAGGTTT ATTTACTGCC GGAAAGGTGA TGGTGATTGA ACGGCGCAAC TTCCAGCGCG TTTATGATTT AACCCACCGT GTCATGCCTG ACTGGGATGA TGAGCGCGAT CTCGTTTCGC AAACAGAAGC AGAAATCATC ATGCTGGATA ACAGTGCGCG TAGCCTGGGC ATATTCCGCG AACAGTGGCT GGCAGATTAC TATCGGCTGA AACGTCCGGC ACTGGCGGCT TGGCGCGAAG CGAGGGCTGA ACTGCAGCAA ATCATTGCTG TGCATGTTGA AAAATTGGGC AATCTTTGGT TGCATGCTGA TTTGCTGCCG CTACTCGAGC GTGCGCTGGC CGGAAAGCTC ACTGCAACGC ACAGCGCGGT ACTTTCGCCT TTTGATCCTG TTGTCTGGGA TCGCAAACGC GCAGAGCAGC TTTTTGATTT TAGCTACCGG CTGGAGTGCT ATACCCCTGC GCCGAAACGC CAGTATGGCT ATTTTGTTCT GCCGTTATTA CATCGTGGGC AATTAGTTGG GCGAATGGAT GCCAAAATGC ATCGCCAGAC AGGCATCCTT GAAGTTATCT CTCTGTGGTT ACAGGAAGGC ATTAAACCAA CGACAACGCT GCAAAAAGGG TTACGTCAGG CGATTACTGA TTTCGCTAAC TGGCAGCAGG CAACGCGGGT GACATTAGGA CGCTGCCCGC AAGGCCTCTT TACTGATTGC CGCACCGGCT GGGAAATAGA CCCCGTCGCA TAA
|
Protein sequence | MSLPHLSLAD ARNLHLAAQG LLNKPRRRAS LEDIPATISR MSLLQIDTIN IVARSPYLVL FSRLGNYPAQ WLDESLARGE LMEYWAHEAC FMPRSDFRLI RHRMLAPEKM GWKYKDAWMQ EHEVEIAQLI QHIHDKGPVR SADFEHPRKG ASGWWEWKPH KRHLEGLFTA GKVMVIERRN FQRVYDLTHR VMPDWDDERD LVSQTEAEII MLDNSARSLG IFREQWLADY YRLKRPALAA WREARAELQQ IIAVHVEKLG NLWLHADLLP LLERALAGKL TATHSAVLSP FDPVVWDRKR AEQLFDFSYR LECYTPAPKR QYGYFVLPLL HRGQLVGRMD AKMHRQTGIL EVISLWLQEG IKPTTTLQKG LRQAITDFAN WQQATRVTLG RCPQGLFTDC RTGWEIDPVA
|
| |