Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0127 |
Symbol | |
ID | 6143926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 139712 |
End bp | 141469 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615028 |
Product | hypothetical protein |
Protein accession | YP_001742244 |
Protein GI | 170684092 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.146537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA CTTTGCCGTT TAAACCCCAT GTGCTGGCGC TAATTTGCAG TGCCGGGCTT TGTGCCGCCT CTGCCGGGCT ATATATAAAA AGCCGCACAG TGGAAGCGCC TGTGGAATCG CAATCGACAC AACAGACTGC GCCTGACATC TCCGCAGTTA CGCTTCCTGC AACGGTTTCC GCGCCTCCCG TAACGCCTGC CGTCGTCAAA TCTACATTCA GCACTGCACA AATAGATCAA TGGGTTGCGC CTGTCGCGCT GTACCCCGAC TCTCTGCTTT CACAAGTGTT AATGGCATCA ACCTATCCGG CAAACGTTGC GCAAGCAGTG CAATGGTCGC ACGATAATCC ACTTAAACAA GGCGATGCTG CTATTCAGGC GGTATCTGAC CAGCCCTGGG ACGCCAGCGT CAAATCACTG GTGGCCTTTC CACAGTTGAT GGCATTGATG GGCGAAAACC CGCAATGGGT GCAAAACCTG GGCGATGCTT TTCTGGCGCA GCCGCAGGAC GTGATGGACT CGGTACAACG ATTGCGGCAA CTGGCGCAAC AAACCGGTTC GCTAAAGTCA TCAACTGAAC AAAAGATCAT CACGACAACG AAAAAAGTCG TACCGGTAAA TCAGCCCGCC AACGCCCCTG TTACACAATC AAATACCGTT TCTACGTCCA GCCCCGTCGT CGCAGAACCT GCACCGACCG TAATAACCAT TGAGCCAGCC AATCCGGATG TGGTTTATAT TCCGAACTAC AACCCAAACG TGGTTTACGG CAGTTGGGCC AATACCGCTT ATCCGCCAGT TTATCTGCCG CCGCCAGCCG GAGAACCGTT TGTTGACAGC TTTGTGCGCG GATTCGGCTA TAGCATGGGC GTTGCTACCA CATACGCACT ATTCAGCAGC ATCGACTGGG ACGACGACGA TCATGACCAT CATCATCATG ACGATGATGA TTATCATCAC CACGATGGCG GTCATCGTGA CGGTAATGAC TGGCAACACA ACGGCGACAA CATCAATATC GACGTCAACA ATTTCAACCG TATCACTGGT GAGCATCTTA CTGATAAGAA TATGGCATGG CGGCACAATC CAAACTACCG TGATGGTGTG CCCTATCATG ATCAGGATAT GGCAAAGCGG TTTCACCAAA CCGATGTCAA CGGCGGCATG AGCGCCACGC AGTTACCTGC TCTATCGCGC GACAGTCAGC GTCAGGCAGC AGCAAGTCAG TTTCAGCAAC GAACACACAC TGCGCCAGTC ATTACGCGAG ATACTCAACG TCAGGCTGCG GCACAGCGAT TTAATGAAGC GGAACATTAT GGGAGCTATG ACGACTTCCG CGAGTTCAGC CGTCGCCAAC CACTGACCCA GCAACAAAAG GACGCTGCTC GTCAGCGTTA TCAGTCGGCC TCGCCTGAGC AGCGCCAGGC AGTCCGCGAG AAAATGCAGA CTAACCCGCA GAACCAGCAG CGAAGAGAGG TAGCGCGTGA GCGTATTCAG TCTGCAACTC CCGAACAGCG CCAGGTGTTT AAGGAAAAAG TACAGCAGCG CCCACTGAAC CAACAGCAAC GTGATAACGC CCGCCAGCGT ATACAATCGG CATCACCTGA ACAACGTCAG GTTTTTCGGG AGAAAGTTCA GGAGAGCCGC CCACAACGTC TAAACGACAG TAACCATACT GTCAGGCTGA ATAACGAGCA ACGGTCAGCA GTACGCGAAC GTCTCTCTGA GCGCGGAGCA AGGCGACTGG AAAGGTAA
|
Protein sequence | MKMTLPFKPH VLALICSAGL CAASAGLYIK SRTVEAPVES QSTQQTAPDI SAVTLPATVS APPVTPAVVK STFSTAQIDQ WVAPVALYPD SLLSQVLMAS TYPANVAQAV QWSHDNPLKQ GDAAIQAVSD QPWDASVKSL VAFPQLMALM GENPQWVQNL GDAFLAQPQD VMDSVQRLRQ LAQQTGSLKS STEQKIITTT KKVVPVNQPA NAPVTQSNTV STSSPVVAEP APTVITIEPA NPDVVYIPNY NPNVVYGSWA NTAYPPVYLP PPAGEPFVDS FVRGFGYSMG VATTYALFSS IDWDDDDHDH HHHDDDDYHH HDGGHRDGND WQHNGDNINI DVNNFNRITG EHLTDKNMAW RHNPNYRDGV PYHDQDMAKR FHQTDVNGGM SATQLPALSR DSQRQAAASQ FQQRTHTAPV ITRDTQRQAA AQRFNEAEHY GSYDDFREFS RRQPLTQQQK DAARQRYQSA SPEQRQAVRE KMQTNPQNQQ RREVARERIQ SATPEQRQVF KEKVQQRPLN QQQRDNARQR IQSASPEQRQ VFREKVQESR PQRLNDSNHT VRLNNEQRSA VRERLSERGA RRLER
|
| |