Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0623 |
Symbol | |
ID | 6143010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 636203 |
End bp | 637423 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641615515 |
Product | phosphoadenosine phosphosulfate reductase family protein |
Protein accession | YP_001742721 |
Protein GI | 170680017 |
COG category | [R] General function prediction only |
COG ID | [COG3969] Predicted phosphoadenosine phosphosulfate sulfotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATTT ATAAAATCCC GCTTCCACTG AATATTCTGG AAGCTGCAAA AGAACGTATT ACCTGGACAC TTAATACCCT ACCCCGTATC TGTGTTTCTT TTTCGGGTGG TAAAGATTCC GGATTAATGC TGCATCTGAC CGCCGAAATT GCCCGACAAA TGGGCAAAAA AATCTGCGTT TTGTTTATCG ACTGGGAAGC GCAGTTCTCA TGCACTATTA ACTATGTTCA GTCATTACGT GAGTTTTACG CCGATGTCAT CGAAGAGTTT TACTGGGTTG CGCTCCCGCT TACGACGCAA AATTCCCTTT CTCAATACCA ACCCGAATGG CAGTGCTGGG AACCTGATGT CGAATGGGTA CGTCAGCCCC CGCAAGATGC GATAACCGAT CCTGACTTTT TCTCCTTTTA CCAGCCTGGC ATGACCTTCG AACAATTTGT GCGTGAGTTT GCCGAATGGT TTTCGCAAAA ACGTCCGGCG GCGATGATGA TCGGCATCCG TGCGGATGAG TCCTACAACC GTTTTGTCGC CATCGCCAGT TTAAATAAAC AACGTTTTGC CGATGATAAG CCCTGGACTA CTGCCGCACC AGGCGGTCAT AGCTGGTACA TTTACCCCAT CTACGACTGG AAAGTCGCTG ATATCTGGAC CTGGTATGCT AACCATCAAC AGCTCTGTAA CCCACTGTAT AATATAATGT ATCAGGCAGG CGTTCCTCTG CGTCATATGC GAATTTGCGA ACCTTTTGGC CCGGAGCAAC GGCAAGGATT ATGGCTCTAT CACGTTATCG AACCGGATCG CTGGGCTGCT ATGTGCGCAC GAGTCAGCGG GGTAAAAAGT GGCGGCATTT ACGCCGGACA TGATAATCAT TTTTATGGTC ATCGGAAAAT CCTCAAGCCA GAACATTTAG ACTGGCAAGA ATATGCATTA TTGCTGCTCA ATAGCATGCC GGAAAAAACA GCTGAGCATT ACCGCAATAA AATCGCCATT TATTTGCACT GGTATCAGAA AAAAGGCATC GAAGTTCCAC AAACCCAGCA AGGGGACATT GGCGCGAAAG ATATCCCCTC CTGGCGGCGA ATATGCAAAG TTTTACTCAA TAACGATTAC TGGTGCCGGG CATTATCATT TAGTCCTACG AAAGCGAAGA ACTATCAGCG TTATAACGAA CGGATAAAAG GAAAACGTCA GGAATGGGGG ATACTATGCA ACAACGATTA A
|
Protein sequence | MSIYKIPLPL NILEAAKERI TWTLNTLPRI CVSFSGGKDS GLMLHLTAEI ARQMGKKICV LFIDWEAQFS CTINYVQSLR EFYADVIEEF YWVALPLTTQ NSLSQYQPEW QCWEPDVEWV RQPPQDAITD PDFFSFYQPG MTFEQFVREF AEWFSQKRPA AMMIGIRADE SYNRFVAIAS LNKQRFADDK PWTTAAPGGH SWYIYPIYDW KVADIWTWYA NHQQLCNPLY NIMYQAGVPL RHMRICEPFG PEQRQGLWLY HVIEPDRWAA MCARVSGVKS GGIYAGHDNH FYGHRKILKP EHLDWQEYAL LLLNSMPEKT AEHYRNKIAI YLHWYQKKGI EVPQTQQGDI GAKDIPSWRR ICKVLLNNDY WCRALSFSPT KAKNYQRYNE RIKGKRQEWG ILCNND
|
| |