Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1703 |
Symbol | |
ID | 6143842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1707032 |
End bp | 1708291 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641616579 |
Product | hypothetical protein |
Protein accession | YP_001743757 |
Protein GI | 170680907 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0566348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATA TTAATACAGC TTGTGTAAAA AATAATGCCA GTTATCAATT AAATAACGCA TTACCTAACA AAGAAACCAT CTCCAGCAAT TTTTGTGAAC GACTGGCACA ATGGGGTAAT AAGTCGCTTA ATAATGGTGA AGAAAGAGCA ATTGCCGTGG AGCGAATTAA AGAAGCTTAC AATTCGAACA TGGCATCTCT TGATTTGTCA TATCTTGATT TGAGTGAGTT ACCCCCTATC CCCTCGACAG TTAACACGTT AAATCTGGAA AACAATTGTC TCACTTGTCT TGACTTTACT GACAATGCCA GCCTCGTCAA TATCAACCTC AGCTTTAATA AAATTAAAAC GATAACCTTT CCAAATCAAT CAAAACTGGA AAATATTTAT ATTGATCACA ATAATCTGGA AAATTTGGAT TTAAAAAATC AGCTTTCATT GGTTAACCTG GAAGCGCAAA ATAACAACCT GACAAAAATT AATATTTCTG ATAGTTATAA ACTGAAATTT CTTAATCTTG ATTATAACAA ACTAGCATCG CTGGATCTCT CCCGGCAAGA ATCTCTGATT GAGTTAAGTG CCCATCACAA CATGATCAAT GACCTTATAT TACACAATCA CCCCATAGTG GAGAAAATCA CTTTAAACGA CAACCATATT GCACATTTAA ACGCGAAAAC CACTACAAAA CTGGAATATT TAAACTTAAG CAATAACAAT TTATTGCCAA CAGATGACAT TGATCAATTA ATATCATCAA AACATCTTTG GCATGTATTA GTTAACGGCA TCAACAATGA TCCACTTGCC CAAATGCAGT ACTGGACTGC AGTAAGAAAT ATAATTGATG ACACTAATGA AGTGACCATT GATTTATCTT ATAACCTGGC AATCACAAAT ATCGATACCA GCGATGAACA TCTTGTAGAA GTAAGCGAGA ATTCCGAAGG AAATCATATA AAAGAAAATG ACTCAATGTC TATTCGTTAT AGATCAAAAT ATTATTCCAG AGAGTACGCC TTAATAGAAG AAGAAACAAT ATTTTCTGAC GCAGAACTAA AAGCTATTCT GCCTATGCGT CGCATGTACG GGGTTGGTGA CTATAAGTCA AATTCCTCTT CTCTACCCTC ACACTCGGGG CTAAAGGACC CAACGGGCAC ACCCGTCTGT TATTATATTC ATAATGAGGA TAAACCTTCC TTAGGTTTTG GTCCAACATC CAATAATTGG TTAAGCCAAT CCTTTACAAC AGAGTTATAA
|
Protein sequence | MTNINTACVK NNASYQLNNA LPNKETISSN FCERLAQWGN KSLNNGEERA IAVERIKEAY NSNMASLDLS YLDLSELPPI PSTVNTLNLE NNCLTCLDFT DNASLVNINL SFNKIKTITF PNQSKLENIY IDHNNLENLD LKNQLSLVNL EAQNNNLTKI NISDSYKLKF LNLDYNKLAS LDLSRQESLI ELSAHHNMIN DLILHNHPIV EKITLNDNHI AHLNAKTTTK LEYLNLSNNN LLPTDDIDQL ISSKHLWHVL VNGINNDPLA QMQYWTAVRN IIDDTNEVTI DLSYNLAITN IDTSDEHLVE VSENSEGNHI KENDSMSIRY RSKYYSREYA LIEEETIFSD AELKAILPMR RMYGVGDYKS NSSSLPSHSG LKDPTGTPVC YYIHNEDKPS LGFGPTSNNW LSQSFTTEL
|
| |