Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1313 |
Symbol | torY |
ID | 6145393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1302228 |
End bp | 1303328 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616191 |
Product | trimethylamine N-oxide reductase III, c-type cytochrome subunit TorY |
Protein accession | YP_001743371 |
Protein GI | 170682004 |
COG category | [C] Energy production and conversion |
COG ID | [COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.563249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGGGA AAAAACGCAT TGGGTTATTG TTTTTGCTGA TAGCGGTTGT GGTTGGTGGC GGCGGGTTAT TGCTGGCGCA AAAAGCCTTA CATAAAACGT CGGATACCGC ATTTTGTCTT TCCTGCCACT CGATGAGTAA ACCTTTTGAG GAATATCAGG GAACTGCCCA CTTTTCGAAC CAGAAAGGTA TACGTGCGGA ATGTGCCGAT TGCCATATTC CAAAGTCAGG GATGGATTAT TTATTTGCTA AATTAAAAGC ATCTAAAGAT ATTTATCATG AATTTGTTAG CGGCAAAATA GACAGTGACG ATAAGTTCGA AGCTCATCGC CAGGAAATGG CCGAAACAGT ATGGAAAGAA TTAAAAGCAA CTGACTCTGC AACCTGCCGT AGTTGCCATT CTTTTGATGC CATGGATATT GCTTCGCAAA GTGAATCTGC GCAGAAAATG CATAATAAAG CGCAAAAGGA CGGCGAAACT TGTATCGATT GCCATAAAGG CATTGCCCAT TTCCCGCCAG AAATAAAAAT GGATGACAAC GCGGCGCATG AGCTGGAAAG TCAGGCCGCT ACTTCAGTGA CTAATGGCGC ACATATTTAT CCTTTCAAAA CTTCTCGCAT AGGCGAACTG GCTACCGTGA CTCCTGGTAC CGATCTCACC GTCGTTGATG CCAGTGGCAA ACAGCCAATT GTTCGGTTGC AGGGTTATCA AATGCAGGGC AGTGAAAACA CGCTCTACCG GGCGGCAGGT CAACGGCTGG CGCTAGCCAC ATTAAGTGAA GAAGGTATCA AGGCGCTAGC GGTAAACGGG GAATGGCAGG CTGACGAATA CGGCAATCAA TGGCGTCAGG CGTCTTTACA GGGTGCGCTT ACCGATCCCG CATTAGCGGA CCGTAAACCG CTATGGCAAT ACGCTGAAAA ACTTGACGAT ACCTATTGCG CTGGCTGCCA TGCCCCTATT GCCGCCGACC ATTACACCGT CAATACATGG CCGTCCATTG CCAAAGGAAT GGGTGCGCGA ACCAGCATGA GCGAAAACGA ACTGGACATT TTGACGCGGT ACTTCCAGTA CAACGCCAAA GATATTACCG AGAAACAGTG A
|
Protein sequence | MRGKKRIGLL FLLIAVVVGG GGLLLAQKAL HKTSDTAFCL SCHSMSKPFE EYQGTAHFSN QKGIRAECAD CHIPKSGMDY LFAKLKASKD IYHEFVSGKI DSDDKFEAHR QEMAETVWKE LKATDSATCR SCHSFDAMDI ASQSESAQKM HNKAQKDGET CIDCHKGIAH FPPEIKMDDN AAHELESQAA TSVTNGAHIY PFKTSRIGEL ATVTPGTDLT VVDASGKQPI VRLQGYQMQG SENTLYRAAG QRLALATLSE EGIKALAVNG EWQADEYGNQ WRQASLQGAL TDPALADRKP LWQYAEKLDD TYCAGCHAPI AADHYTVNTW PSIAKGMGAR TSMSENELDI LTRYFQYNAK DITEKQ
|
| |