Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1865 |
Symbol | |
ID | 6146650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1887771 |
End bp | 1889666 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641616741 |
Product | hypothetical protein |
Protein accession | YP_001743919 |
Protein GI | 170681982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000000000596823 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGGGA AGTTTCGCTG CATTTTGCTG TTGATAGTTG GGCTTTTTTT CTCTTCGTTG AGTTATGCGA AAAACACGGA GATCCCTTCT TATGAAGAAG GGATCTCTCT CTTTGATGTT GAAGCCACTC TGCAACCGGA TGGGGTGCTC GACATCAAAG AAAATATTCA TTTTCAGGCG CGAAATCAGC AGATTAAGCA CGGATTTTAT CGTGATTTAC CACGACTCTG GATGCAGCCT GATGGGGACG CTGCACTGCT GAACTATCAT ATTGTTGGCG TCACCCGTGA TGGTATTCCT GAACCCTGGC ATCTTGACTG GCATATCGGG TTAATGAGTA TTGTCGTGGG CGATAAACAA CGTTTCTTGC CTCAAGGCGA CTATCATTAT CAAATTCATT ATCAGGTTAA AAATGCTTTC CTGCGTGAAG GGGATTCTGA TCTGCTAATC TGGAACGTGA CCGGTAACCA CTGGCCGTTT GAAATTTATA AGACCCTATT TTCACTCAAG TTGCCAGATA TTGCGGGTAA TCCATTTAGC GAAATCGATC TCTTTACTGG AGAAGAGGGC GACACATATC GAAATGGCCG CATCCTTGAG GACGGAAGAA TTGAATCCCG CGATCCGTTT TATCGTGAAG ATTTCACGGT CCTCTACCGC TGGCCTCACG CTTTACTTGG TAATGCCCCG GCACCACAAA CGACGAATAT TTTCAGCCAT CTTCTTTTAC CCTCTACGTC ATCGTTGTTA ATTTGGTTTC CGTGTCTCTT CCTGGCGTGT GGATGGTTAT ATCTCTGGAA GCGCAGGCCG CAATTTACGC CGGTAGATGT GATTGAAACC GATGTCATTC CGCCAGATTA CACACCCGGC ATGTTACGTC TCGATGCGAA GCTGGTTTAC GACGATAAAG GTTTTTGTGC CGATATCGTA AATCTGATTG TTAAAGGAAA AATTCATCTG GAAGATCAGT ATGACAAGAA CCAGCAAATC CTGATTTGTG TTAATGAAGG CGCGACCAGA AATAATGCGG TATTACTGCC CGCAGAGCAG TTATTACTGG AAGCGTTATT TCGTAAAGGC GATAAGGTCG TTCTTACGGG GAGACGCAAC AGAGTCTTAC GCAGGGCATT TTTACGGATG CAGAAATTTT ATCTGCCGCG TAAAAAGTCT TCGTTTTATC GACCAGATAC GTTTTTGCAA TGGGGCGGAA TGGCGATATT GGCGGTCATT CTCTACGGTA ACCTGAGTCC CGTAGGTTGG GCAGGAATGA GTCTGGTTGG CGATATGTTT ATTATGATCT GCTGGCTTCT TCCTTTTTTA TTTTGTTCCC TTGAGCTTTT GTTTGCCCGC GATGATGACA AGCCTTGCGT TAATCGTGTA ATCATCACTT TGTTTTTACC GCTGATTTGT TCAGGCGTGG CCTTTTATTC TCTCTATATC AATGTCGGAG ATGTATTCTT TTACTGGTAT ATGCCAGCGG GTTATTTTAG CGCTGTTTTC CTGACCGGTT ATCTCACTGG CATGGGGTAT ATTTTTCTGC CAAAGTTTAC CCAAACTGGG CAGCAACGTT ATGCCCACGG TGAAGCTATC GTTAACTATC TTGCGCGTAA AGAGGCAGCA ACACACAGTG GGCGGCGGCG GAAAGGGGAA ACACGGAAAC TGGATTACGC GTTGCTAGGT TGGGCAGTCT CGGCAAACCT TGGAAAAGAA TGGGCAGCAC GTATCACCCC ATCACTCACA GCGGCTGTTC ACGCCCCGGA AATTGCCCGT AGTGGCGTTT TGTTTTCATT ACAGATGCAC CTGAGCCTGG GGGCCAATAC CAGTTTGTTG GGGCGAAGTT ATTCCGGTGG TGGTGCTGGC GGCGGGGCGG GTGGCGGAGG CGGTGGTGGC TGGTAA
|
Protein sequence | MAGKFRCILL LIVGLFFSSL SYAKNTEIPS YEEGISLFDV EATLQPDGVL DIKENIHFQA RNQQIKHGFY RDLPRLWMQP DGDAALLNYH IVGVTRDGIP EPWHLDWHIG LMSIVVGDKQ RFLPQGDYHY QIHYQVKNAF LREGDSDLLI WNVTGNHWPF EIYKTLFSLK LPDIAGNPFS EIDLFTGEEG DTYRNGRILE DGRIESRDPF YREDFTVLYR WPHALLGNAP APQTTNIFSH LLLPSTSSLL IWFPCLFLAC GWLYLWKRRP QFTPVDVIET DVIPPDYTPG MLRLDAKLVY DDKGFCADIV NLIVKGKIHL EDQYDKNQQI LICVNEGATR NNAVLLPAEQ LLLEALFRKG DKVVLTGRRN RVLRRAFLRM QKFYLPRKKS SFYRPDTFLQ WGGMAILAVI LYGNLSPVGW AGMSLVGDMF IMICWLLPFL FCSLELLFAR DDDKPCVNRV IITLFLPLIC SGVAFYSLYI NVGDVFFYWY MPAGYFSAVF LTGYLTGMGY IFLPKFTQTG QQRYAHGEAI VNYLARKEAA THSGRRRKGE TRKLDYALLG WAVSANLGKE WAARITPSLT AAVHAPEIAR SGVLFSLQMH LSLGANTSLL GRSYSGGGAG GGAGGGGGGG W
|
| |