Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1676 |
Symbol | |
ID | 6143933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1671347 |
End bp | 1672504 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641616552 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001743730 |
Protein GI | 170682242 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGTTA CAGCCAAGCC CTCCAGTTTT CAATGTAATC TCAAATGTGA TTACTGTTTT TACCTTGAAA AAGAGTCGCA GTTTACTCAT GAAAAATGGA TGGATGACAG CACTCTGAAA GAGTTCATCA AACAATATAT CGCAGCGTCT GGCAATCAGG TCTATTTTAC CTGGCAAGGC GGTGAACCCA CTCTGGCTGG CCTGGATTTT TTCCGTAAAG TTATTCACTA TCAACAACGC TATGCAGGCC AAAAACGTAT TTTTAATGCA TTACAAACGA ATGGCATTTT ATTGAATAAT GAATGGTGTG CCTTTCTCAA AGAACATGAG TTTCTGGTTG GTATCTCGAT CGATGGACCC CAGGAGCTAC ATGACCGTTA CAGACGCAGT AATTCAGGTA ACGGTACTTT TGCAAAAGTG ATAGCAGCCA TCGAGCGTCT GAAATCATAT CAAGTAGAGT TTAATACGTT AACCGTCATT AATAACGTTA ATGTCCATTA CCCTCTTGAG GTTTATCATT TTTTAAAATC TATCGGCAGT AAACATATGC AATTTATCGA ATTGCTCGAA ACCGGGACGC CGAATATTGA TTTCAGTGGT CATAGTGAGA ACACATTCCG TATCATTGAT TTTTCTGTGC CTCCTACGGC TTATGGCAAG TTTATGTCAA CCATTTTTAT GCAATGGGTT AAAAACGATG TGGGTGAAAT TTTCATCCGT CAGTTTGAAA GCTTTGTCAG CCGTTTTTTG GGGAATGGGC ATACCAGTTG TATTTTCCAG GCGTCCTGCA AGGATAATCT GGTTGTTGAA AGTAATGGAG ACATTTACGA ATGCGACCAT TTTGTCTATC CACAGTACAA AATTGGAAAC ATTAATAAAT CTGAACTCAA AACGATGAAC AGTGTTCAAC TGACAGCGCA AAAAAAACGG ATTTCAGCGA AATGTCAGCA ATGTGCATAT AAACCTATCT GCAATGGTGG TTGCCCTAAG CATCGTATTA CTAAAGTAAA CAATGAGACT GTTTCCTATT TTTGCGAAGG TTATAAAATC CTTTTTTCAA CCATGGTACC TTATATGAAC GCCATGGTGG AGTTAGCTAA GAACAGAGTA CCGCTTTACC ACATTATGGA TGTTGCAAAA CAAATGGAGA ATAATTAA
|
Protein sequence | MHVTAKPSSF QCNLKCDYCF YLEKESQFTH EKWMDDSTLK EFIKQYIAAS GNQVYFTWQG GEPTLAGLDF FRKVIHYQQR YAGQKRIFNA LQTNGILLNN EWCAFLKEHE FLVGISIDGP QELHDRYRRS NSGNGTFAKV IAAIERLKSY QVEFNTLTVI NNVNVHYPLE VYHFLKSIGS KHMQFIELLE TGTPNIDFSG HSENTFRIID FSVPPTAYGK FMSTIFMQWV KNDVGEIFIR QFESFVSRFL GNGHTSCIFQ ASCKDNLVVE SNGDIYECDH FVYPQYKIGN INKSELKTMN SVQLTAQKKR ISAKCQQCAY KPICNGGCPK HRITKVNNET VSYFCEGYKI LFSTMVPYMN AMVELAKNRV PLYHIMDVAK QMENN
|
| |