Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3400 |
Symbol | |
ID | 6147101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3481264 |
End bp | 3482250 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618229 |
Product | hypothetical protein |
Protein accession | YP_001745378 |
Protein GI | 170682085 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0435] Predicted glutathione S-transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCAAC TGATTGACGG CGTCTGGCAT GACACCTGGT ACGATACCAA ATCTACCGGC GGTAAATTTC AACGTTCAGC TTCCGCATTT CGTAACTGGC TCACTGCCGA TGGCGCGCCA GGCCCCACTG GCAAAGGCGG TTTTATCGCA GAGAAAGATC GTTATCATCT CTATGTTTCA CTCGCCTGCC CGTGGGCGCA CCGCACGCTG ATCATGCGTA AACTCAAAGG ACTGGAACCG TTTATTTCCG TTTCCGTAGT GAACCCGCTG ATGCTGGAAA ACGGCTGGAC CTTTGATGAC AGTTTTCCGG GAGCAACCGG CGACACGCTC TATCAACATG AATTTCTGTA TCAGCTTTAT CTCCACGCCG ATCCACACTA CAGCGGACGA GTGACTGTTC CCGTGCTGTG GGACAAAAAG AACCACACCA TCGTCAGCAA CGAATCAGCA GAAATCATAC GCATGTTCAA TACCGCGTTT GATGCGCTGG GCGCGAAAGC GGGTGATTAC TACCCACCAG CCCTGCAAAC GAAAATTGAC GAACTTAACG GCTGGATTTA TGACACTGTT AACAACGGCG TGTATAAAGC CGGTTTTGCC ACCAGCCAGC AAGCTTACGA CGAGGCGGTG GCGAAAGTGT TTGAATCGCT GGCGCGACTG GAACAGATTT TAGGTCAGCA CCGTTACCTG ACCGGCAACC AGCTAACCGA AGCCGATATT CGCCTGTGGA CCACGCTGGT GCGTTTTGAT CCAGTGTATG TGACCCACTT CAAGTGTGAT AAGCACCGCA TCAGCGATTA CCTGAATCTG TATGGCTTCC TGCGCGATAT CTACCAGATG CCGGGAATTG CCGAAACAGT CAATTTCGAT CACATCCGTA ATCATTACTT CCGCAGCCAT AAGACCATCA ACCCTACGGG GATTATTTCA ATTGGTCCGT GGCAGGATCT CGATGAACCG CATGGACGAG ATGTTCGCTT CGGTTAA
|
Protein sequence | MGQLIDGVWH DTWYDTKSTG GKFQRSASAF RNWLTADGAP GPTGKGGFIA EKDRYHLYVS LACPWAHRTL IMRKLKGLEP FISVSVVNPL MLENGWTFDD SFPGATGDTL YQHEFLYQLY LHADPHYSGR VTVPVLWDKK NHTIVSNESA EIIRMFNTAF DALGAKAGDY YPPALQTKID ELNGWIYDTV NNGVYKAGFA TSQQAYDEAV AKVFESLARL EQILGQHRYL TGNQLTEADI RLWTTLVRFD PVYVTHFKCD KHRISDYLNL YGFLRDIYQM PGIAETVNFD HIRNHYFRSH KTINPTGIIS IGPWQDLDEP HGRDVRFG
|
| |