Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3952 |
Symbol | |
ID | 6143539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4029318 |
End bp | 4030277 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618777 |
Product | polysaccharide deacetylase family protein |
Protein accession | YP_001745916 |
Protein GI | 170683133 |
COG category | [S] Function unknown |
COG ID | [COG2861] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.652766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTCCAT TTCGTCGTAA CGTTCTTGCA TTTGCCGCTC TGTTGGCGCT CTCCTCCCCC GTAATTGCTG GCAAACTTGC CATCGTCATT GATGATTTTG GGTATCGCCC GCACAACGAA AACCAGGTGC TGGCGATGCC TTCCGCTATC TCCGTCGCTG TATTACCCGA TTCACCGCAC GCCAGAGAGA TGGCGACCAA AGCGCATAAC AGCGGTCACG AAGTGTTGAT TCATCTACCG ATGGCACCAC TGAGTAAACA GCCGCTGGAG AAAAATACGC TACGCCCGGA GATGAGCAGC GACGAAATTG AGCGCATTAT TCGTAGTGCG GTCAATAACG TGCCCTATGC CGTGGGGATC AACAACCACA TGGGCAGCAA GATGACCTCT AACCTGTTTG GTATGCAGAA AGTGATGCAG GCGCTGGAGC GTTACAATCT TTACTTCCTC GACAGCGTAA CCATCGGTAA TACCCAGGCG ATGCGCGCCG CGCAAGGCAC TGGCGTGAAG GTGATCAAAC GGAAGGTGTT CCTCGACGAT TCGCAAAATG AAGCGGACAT CCGTGTGCAA TTCAATCGCG CAATTGACCT GGCGCGTCGC AACGGTTCAA CCATTGCCAT TGGACATCCT CACCCTTCAA CGGTACGCGT GCTACAACAG ATGGTTTATA ACCTGCCGCC AGACATTACG CTGGTGAAAG CCAGCAGCTT GTTGAATGAA CCGCAGGTTG ATACGTCTAC ACCGCCGAAA AACACTGTGC CCGACACACC GCGTAATCCA TTCCGTGGCG TGAAGCTGTG CAAACCGAAG AAGCCGCTGG AACCTGTTTA TGCTAATCGC TTCTTTGAAG TATTAAGCGA AAGCATCAGC CAGAGCACGC TGATCGTTTA CTTCCAGCAT CAGTGGCAAG GCTGGGGCAA ACAGCCTCAA GCGGCGAAGC TTAACGCTAG CGCAAATTAA
|
Protein sequence | MFPFRRNVLA FAALLALSSP VIAGKLAIVI DDFGYRPHNE NQVLAMPSAI SVAVLPDSPH AREMATKAHN SGHEVLIHLP MAPLSKQPLE KNTLRPEMSS DEIERIIRSA VNNVPYAVGI NNHMGSKMTS NLFGMQKVMQ ALERYNLYFL DSVTIGNTQA MRAAQGTGVK VIKRKVFLDD SQNEADIRVQ FNRAIDLARR NGSTIAIGHP HPSTVRVLQQ MVYNLPPDIT LVKASSLLNE PQVDTSTPPK NTVPDTPRNP FRGVKLCKPK KPLEPVYANR FFEVLSESIS QSTLIVYFQH QWQGWGKQPQ AAKLNASAN
|
| |