Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2128 |
Symbol | torC |
ID | 6143592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2137374 |
End bp | 2138546 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617004 |
Product | cytochrome c-type protein torC |
Protein accession | YP_001744179 |
Protein GI | 170680538 |
COG category | [C] Energy production and conversion |
COG ID | [COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit |
TIGRFAM ID | [TIGR02162] trimethylamine-N-oxide reductase c-type cytochrome TorC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAAC TCTGGAACGC GTTACGCCGA CCCAGTGCTC GTTGGTCGGT ACTGGCGCTG GTCGCTATTG GGATTGTGAT TGGCATTGCG CTGATTGTAT TGCCACACGT TGGGATCAAA GTCACCAGCA CAACCGAATT TTGTGTCAGT TGCCACAGTA TGCAACCGGT GTATGAAGAA TATAAACAGT CGGTGCATTT CCAGAACGCC TCCGGCGTGC GAGCTGAATG CCATGACTGT CATATCCCGC CGGATATTCC AGGCATGGTG AAGCGCAAAC TGGAAGCGAG CAACGACATT TACCAGACCT TTATTGCCCA CTCCATTGAT ACACCAGAGA AATTTGAAGC CAAACGCGCG GAACTTGCCG AGCGTGAATG GGCGCGAATG AAAGAAAACA ACTCGGCAAC CTGCCGCTCC TGCCATAACT ACGACGCGAT GGATCATGCG AAGCAGCATC CTGAAGCGGC ACGTCAGATG AAGGTGGCAG CGAAAGATAA TCAATCCTGT ATCGACTGTC ATAAAGGTAT TGCCCACCAG TTACCGGATA TGAGTAGTGG CTTCCGTAAG CAGTTCGATG AGCTGCGCGC CAGTGCTAAT GATAGTGGTG ACACGCTGTA CTCCATTGAT ATTAAGCCGA TTTATGCGGC GAAAGGCGAT AAAGAAGCCT CTGGTTCTCT GCTGCCTGCT TCGGCAGTGA AAGTCATTAA ACGTGACGGC GACTGGCTGC AAATTGAAAT CACTGGCTGG ACGGAAAGTG CAGGACGTCA GCGTGTACTC ACCCAATTCC CAGGTAAACG CATCTTTGTT GCCTCGATTC GTGGTGATGT GCAGCAGCAG GTGAAAACGC TGGAGAAAAC CACCGTTGCC GACACCAATA CCGAGTGGAG CAAGTTGCAA GCCACTGCGT GGATGAAGAA AGGCGACATG GTAAACGATA TCAAACCGAT CTGGGCTTAT GCGGATTCGT TGTACAACGG CACCTGTAAC CAGTGCCACG GCGCACCGGA AATCTCTCAC TTTGACGCTA ACGGTTGGAT CGGCACGCTC AACGGCATGA TTGGCTTTAC CAGCCTCGAT AAACGTGAAG AACGCACCTT GTTGAAATAT CTGCAAATGA ATGCGTCTGA TACCGCAGGT AAGGCTCACG GCGATAAGAA GGAAGAAAAA TAA
|
Protein sequence | MRKLWNALRR PSARWSVLAL VAIGIVIGIA LIVLPHVGIK VTSTTEFCVS CHSMQPVYEE YKQSVHFQNA SGVRAECHDC HIPPDIPGMV KRKLEASNDI YQTFIAHSID TPEKFEAKRA ELAEREWARM KENNSATCRS CHNYDAMDHA KQHPEAARQM KVAAKDNQSC IDCHKGIAHQ LPDMSSGFRK QFDELRASAN DSGDTLYSID IKPIYAAKGD KEASGSLLPA SAVKVIKRDG DWLQIEITGW TESAGRQRVL TQFPGKRIFV ASIRGDVQQQ VKTLEKTTVA DTNTEWSKLQ ATAWMKKGDM VNDIKPIWAY ADSLYNGTCN QCHGAPEISH FDANGWIGTL NGMIGFTSLD KREERTLLKY LQMNASDTAG KAHGDKKEEK
|
| |