Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4402 |
Symbol | |
ID | 6146350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4494650 |
End bp | 4496383 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619223 |
Product | hypothetical protein |
Protein accession | YP_001746347 |
Protein GI | 170680859 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.256321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.491316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTCCA CAGAAGTCCA GGCTAAACCC CTTTTTAGCT GGAAAGCCCT GGGTTGGGCA CTGCTCTACT TTTGGTTTTT CTCTACTCTG CTACAGGCCA TTATTTACAT CAGTGGTTAT AGTGGCACTA ACGGCATTCG CGACTCGCTG TTATTCAGTT CGCTGTGGTT GATCCCGGTA TTCCTCTTTC CGAAGCGGAT CAAAATTATT GCCGCAGTGA TCGGCGTGGT GCTATGGGCG GCCTCTCTGG CGGCGCTGTG CTACTACGTC ATCTACGGTC AGGAGTTCTC GCAGAGCGTT CTGTTTGTGA TGTTCGAAAC CAACACCAAC GAAGCTAGCG AGTATTTAAG CCAGTATTTC AGCCTGAAAA TTGTGCTTAT CGCGCTGGCC TATACGGCGG TGGCAGTTCT GCTGTGGACA CGCCTGCGCC CGGTCTATAT TCCAAAGCCG TGGCGTTATG TTGTCTCTTT TGCCCTGCTT TATGGCTTGA TTCTGCATCC GATCGCCATG AACACCATCA TCAAAGGCAA ACCGATTGAA AAAACGCTGG ATAGTCTGGC ATCGCGAATG GAACCCGCAG CACCATGGCA ATTTATTTCC GGCTACTACC AGTACCGCCA GCAACTTAAC TCGCTGACCA AATTACTCAA CGAAAACAAT GCGCTGCCGC CGCTGGCTAA TTTCAAAGAT GAATCGGGTA ACGAACCGCG CACCTTAGTG CTGGTGATTG GCGAGTCGAC CCAGCGTGGA CGCATGAGTC TGTACGGTTA TCCGCGTGAA ACCACGCCGG AGCTGGATGC GCTGCATAAA ACCGATCCGA ATCTGACCGT GTTTAATAAC GTGGTGACGT CTCGTCCGTA CACCATTGAA ATCCTGCAAC AGGCGCTGAC CTTTGCCAAT GAAAAGAACC CGGATCTGTA TCTGACGCAG CCGTCGCTGA TGAACATGAT GAAACAGGCG GGTTATAAAA CCTTCTGGAT CACCAACCAG CAGACGATGA CCGCCCGCAA TACCATGCTG ACGGTATTTT CGCGCCAGAC CGACAAGCAG TACTACATGA ACCAGCAACG TACACAGAGT GCGCGTGAAT ACGACACTAA CGTGCTGAAG CCGTTCCAGG AAGTGCTGAA GGACCCTGCG CCGAAGAAAC TGATCATCGT TCATCTGCTG GGTACGCATA TCAAATACAA ATACCGCTAC CCGGAAAATC AGGGCAAGTT TGATGGCAAT ACCGATCATG TCCCGCCGGG GTTAAACGCG GAAGAGCTGG AGTCATATAA CGATTATGAC AACGCTAACC TGTATAACGA TCATGTGGTT GCCAGCCTGA TTAAAGACTT TAAAGCGGCA GACCCGAACG GATTCCTGGT TTACTTCTCT GACCACGGTG AAGAGGTTTA CGACACGCCG CCGCATAAAA CCCAGGGGCG TAATGAGGAC AACCCGACGC GCCACATGTA CACCATTCCG TTCCTGCTGT GGACGTCGGA AAAATGGCAA GCGACTCATC CGCGTGATTT CTCACAGGAT GTTGATCGTA AATACAGCCT GGCGGAATTG ATCCACACCT GGTCAGATTT AGCGGGCTTA TCTTACGACG GTTACGACCC AACCCGTTCA GTGGTGAATC CGCAGTTCAA AGAAACTACC CGCTGGATTG GTAACCCGTA CAAGAAAAAC GCGCTGATCG ATTACGACAC GCTGCCGTAT GGCGACCAGG TAGGTAATCA GTAA
|
Protein sequence | MHSTEVQAKP LFSWKALGWA LLYFWFFSTL LQAIIYISGY SGTNGIRDSL LFSSLWLIPV FLFPKRIKII AAVIGVVLWA ASLAALCYYV IYGQEFSQSV LFVMFETNTN EASEYLSQYF SLKIVLIALA YTAVAVLLWT RLRPVYIPKP WRYVVSFALL YGLILHPIAM NTIIKGKPIE KTLDSLASRM EPAAPWQFIS GYYQYRQQLN SLTKLLNENN ALPPLANFKD ESGNEPRTLV LVIGESTQRG RMSLYGYPRE TTPELDALHK TDPNLTVFNN VVTSRPYTIE ILQQALTFAN EKNPDLYLTQ PSLMNMMKQA GYKTFWITNQ QTMTARNTML TVFSRQTDKQ YYMNQQRTQS AREYDTNVLK PFQEVLKDPA PKKLIIVHLL GTHIKYKYRY PENQGKFDGN TDHVPPGLNA EELESYNDYD NANLYNDHVV ASLIKDFKAA DPNGFLVYFS DHGEEVYDTP PHKTQGRNED NPTRHMYTIP FLLWTSEKWQ ATHPRDFSQD VDRKYSLAEL IHTWSDLAGL SYDGYDPTRS VVNPQFKETT RWIGNPYKKN ALIDYDTLPY GDQVGNQ
|
| |