Gene RSP_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3046 
SymboldorC 
ID3721632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp81663 
End bp82841 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID640072722 
ProductDMSO/TMAO pentaheme cytochrome c subunit 
Protein accessionYP_354563 
Protein GI77465060 
COG category[C] Energy production and conversion 
COG ID[COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit 
TIGRFAM ID[TIGR02162] trimethylamine-N-oxide reductase c-type cytochrome TorC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCAGCA GGATTTGGAA GGCTTTCTGG CGACCGAGCA CGAAATGGGG GCTCGGCGTC 
CTGCTCGTGA CCGGCGGCAT CGCCGGTGCG GTCGGATGGA ACGGGTTCCA CTATGTGGTG
GAAAAGACCA CCACGACGGA ATTCTGCATC AGCTGCCACT CGATGCGGGA CAACAACTAC
GAGGAATACA AGACCACCAT CCACTACCAG AACACCTCGG GCGTGCGGGC GGAATGCGCC
GACTGTCACG TCCCGAAATC CGGCTGGAAG CTCTACCGCG CGAAGCTCCT CGCCGCGAAG
GACCTCTGGG GCGAAATTCA GGGCACCATC GACACGCGTG AGAAGTTCGA GGCGCACCGG
CTCGAGATGG CCGAGACCGT CTGGGCCGAC ATGAAGGCCA ACGACTCGGC CACCTGCCGG
ACCTGCCACT CGTTCAACGC GATGGACTTC GCCCACCAGA AGCCCGAGGC CTCGAAGCAG
ATGCAGCAGG CGATGAACGA GGGCGGAACC TGCATCGACT GCCACAAGGG CATCGCCCAC
AAGCTGCCCG ACATGGCCAG CGGCTACCGC GCGCTGTTCT CGAAGCTCGA GAAGGCCTCG
CAGTCGCTCA AGCCCAGCAA GGGCGAGACG CTCTATCCGC TCCAGACCAT CGAGGCCTAT
CTCGAGCGGC CCTCGGGCGA CAAGGCGAAG GCCGACGGCC GGCTTCTGGC CGCGACGCCG
ATGCAGGTGG TCGACGTGAA GGGTGAGTGG GTGCAGGTCG CGGTGAAGGG CTGGCAGCAG
GAAGGCGCCG AGCGGGTCAT CTACGAGAAG CAGGGCAAGC GGATCTTCAA CGCCGCACTG
GCGCCGACGG CCACGGGCTC GATCGTGGCG GGCGCGTCCA TGGTCGATCC GGACACCGAA
CAGACCTGGA CGGATGTCTC GCTGACGGCC TGGGTGCGCA ACCGCGACCT GACCGACGAC
CAGGAAGCGC TCTGGCAGTA TGGCAAGCAG ATGTTCAACG GTGCCTGCGG CATGTGTCAC
GTCCTGCCCC ACACCGAGCA TTTCCTCGCC AACCAGTGGA TCGGCACGCT CAACGCCATG
AAGAGCCGGG CGCCGCTCGA TGACGAACAG TTCCGCCTCG TGCAGCGCTA CGTCCAGATG
CATGCGAAGG ACGTGGAACC GGAAGGAGCT GCGGAATGA
 
Protein sequence
MISRIWKAFW RPSTKWGLGV LLVTGGIAGA VGWNGFHYVV EKTTTTEFCI SCHSMRDNNY 
EEYKTTIHYQ NTSGVRAECA DCHVPKSGWK LYRAKLLAAK DLWGEIQGTI DTREKFEAHR
LEMAETVWAD MKANDSATCR TCHSFNAMDF AHQKPEASKQ MQQAMNEGGT CIDCHKGIAH
KLPDMASGYR ALFSKLEKAS QSLKPSKGET LYPLQTIEAY LERPSGDKAK ADGRLLAATP
MQVVDVKGEW VQVAVKGWQQ EGAERVIYEK QGKRIFNAAL APTATGSIVA GASMVDPDTE
QTWTDVSLTA WVRNRDLTDD QEALWQYGKQ MFNGACGMCH VLPHTEHFLA NQWIGTLNAM
KSRAPLDDEQ FRLVQRYVQM HAKDVEPEGA AE