Gene Rsph17029_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3775 
Symbol 
ID4898899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp898107 
End bp900575 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content67% 
IMG OID640114379 
Productmolybdopterin guanine dinucleotide-containing S/N-oxide reductases 
Protein accessionYP_001045627 
Protein GI126464514 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0227198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.692951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAGT TGTCAGGTCA GGAGCTGCAT GCCGAACTCT CGCGGCGCGC CTTCCTGAGC 
TATACGGCGG CTGTGGGGGC TCTCGGTCTC TGCGGCACCT CGCTCCTCGC GCAGGGAGCC
CGCGCGGAAG GTCTCGCCAA CGGCGAGGTC ATGTCGGGCT GCCACTGGGG CGTGTTCAAG
GCCCGGGTCG AGAACGGCCG CGCCGTGGCC TTCGAGCCCT GGGACAAGGA CCCCGCGCCG
TCGCACCAGC TGCCGGGCGT GCTCGATTCG ATCTATTCGC CCACGCGGAT CAAATATCCG
ATGGTGCGCC GCGAATTCCT CGAGAAGGGC GTGAATGCCG ACCGCTCCAC CCGCGGCAAC
GGCGATTTCG TCCGCGTCAC CTGGGATGAG GCGCTCGACC TCGTGGCCAA GGAGCTGAAG
CGCGTTCAGG AAAGCTACGG GCCCACCGGC ACCTTCGGCG GCTCCTACGG CTGGAAGAGC
CCGGGCCGGC TGCACAACTG TCAGGTCCTC ATGCGCCGCG CGCTGAATCT CGCGGGCGGG
TTCGTGAACT CGTCGGGCGA CTATTCGACC GGCGCCGCGC AGATCATCAT GCCGCATGTC
ATGGGCACGC TCGAGGTCTA CGAGCAGCAG ACCGCCTGGC CCGTGGTGGT GGACAACACC
GAACTCATGG TCTTCTGGGC CGCCGATCCG GTGAAGACCA ACCAGATCGG CTGGGTGGTC
CCCGACCATG GCGCCTTCGC GGGCATGCAG GCAATGAAGG AAAAGGGCAC CAAGGTCATC
TGCATCAACC CCGTGCGCAC CGAGACGGCC GACTATTTCG GCGCCGAACT CGTGTCGCCG
CGGCCGCAGA CCGACGTGGC GCTGATGCTC GGCATGGCGC ACACGCTCTA CAGCGAGGAT
CTGCACGACA AGGACTTCAT CGAGAACTGC ACTTCGGGCT TCGACATCTT CGCGGCCTAC
CTGACCGGTG AAAGCGACGG CACGCCCAAG ACGGCCGAAT GGGCCGCCGA GATCTGCGGC
CTGCCGGCCG AGCAGATCAA GGAGCTCGCC CGCCGCTTCG TGGGCGGCCG GACGATGCTC
GCCGCGGGCT GGTCGATCCA GCGGATGCAC CATGGCGAAC AGGCGCACTG GATGCTCGTC
ACGCTCGCCT CGATGATCGG CCAGATCGGT CTTCCGGGCG GCGGCTTCGG CCTCAGCTAC
CACTACTCCA ACGGTGGCTC GCCCACGAGC GACGGCCCGG CGCTGGGCGG GATCTCGGAC
GGCGGGAAGG CGGTCGAAGG CGCGGCCTGG CTGTCGGAGA GCGGCGCGGC CTCGATCCCC
TGTGCCCGCG TGGTGGACAT GCTGCTCAAT CCGGGCGGCG AGTTCCAGTT CAACGGCGCC
ACGGCGACCT ATCCCGACGT GAAGCTGGCC TACTGGGTGG GCGGCAACCC CTTCGCGCAC
CACCAGGACC GCAACCGGAT GCTCAAGGCC TGGGAAAAGC TCGAGACCTT CATCGTGCAG
GACTTCCAGT GGACCGCCAC CGCGCGCCAC GCCGACATCG TCCTGCCGGC GACGACCTCC
TACGAACGCA ACGACATCGA GTCGGTGGGC GACTATTCGA ACCGCGCCAT CCTCGCGATG
AAGAAGGTGG TCGATCCGCT CTACGAGGCC CGGTCGGACT ACGACATCTT CGCAGCCCTG
ACCGAGCGTC TGGGCAAGGG CAAGGAATTC ACCGAAGGCC GCGACGAGAT GGGCTGGATC
AGCTCGTTCT ACGAGGCGGC GGTGAAGCAG GCCGAGTTCA AGCAGATGGA GATGCCGTCG
TTCGAGGACT TCTGGTCGGA AGGGATCGTC GAGTTCCCGA TCACCGAGGG TGCGAACTTC
GTCCGCTATG CCGACTTCCG CGAGGATCCG CTGTTCAACC CGCTCGGCAC GCCCTCGGGC
CTGATCGAGA TCTACTCGAA GAACATCGAG AAGATGGGCT ATGACGATTG CCCGGCCCAT
CCGACCTGGA TGGAACCGGC CGAGCGTCTC GGCGGGCCGG GGGCGAAATA TCCGCTCCAT
GTGGTGGCGA GCCACCCGAA CTCGCGGCTG CACTCGCAGC TGAACGGCAC CTCGCTGCGT
GACCTCTATG CGGTGGCGGG GCACGAGCCC TGTCTCATCA ACCCCGACGA TGCGGCCGCG
CGCGGCATCG CGGACGGCGA TGTGCTGCGG GTGTTCAACG ACCGCGGGCA GATCCTCGTG
GGCGCGAAGG TGAGCGACGC GGTGATGCCG GGCGCGATCC AGGTCTACGA GGGCGGCTGG
TACGACCCGC TCGACCCCTC GGAGGAAGGC ACGCTCGACA AATACGGCGA CGTGAACGTG
CTGTCGCTCG ACGTCGGCAC CTCGAAGCTG GCGCAGGGCA ACTGCGGCCA GACCATCTTG
GCGGATGTCG AGAAATATGC GGGCGCGCCG GTGACGGTGA CCGTGTTCGA CACGCCGAAG
GACGCCTGA
 
Protein sequence
MTKLSGQELH AELSRRAFLS YTAAVGALGL CGTSLLAQGA RAEGLANGEV MSGCHWGVFK 
ARVENGRAVA FEPWDKDPAP SHQLPGVLDS IYSPTRIKYP MVRREFLEKG VNADRSTRGN
GDFVRVTWDE ALDLVAKELK RVQESYGPTG TFGGSYGWKS PGRLHNCQVL MRRALNLAGG
FVNSSGDYST GAAQIIMPHV MGTLEVYEQQ TAWPVVVDNT ELMVFWAADP VKTNQIGWVV
PDHGAFAGMQ AMKEKGTKVI CINPVRTETA DYFGAELVSP RPQTDVALML GMAHTLYSED
LHDKDFIENC TSGFDIFAAY LTGESDGTPK TAEWAAEICG LPAEQIKELA RRFVGGRTML
AAGWSIQRMH HGEQAHWMLV TLASMIGQIG LPGGGFGLSY HYSNGGSPTS DGPALGGISD
GGKAVEGAAW LSESGAASIP CARVVDMLLN PGGEFQFNGA TATYPDVKLA YWVGGNPFAH
HQDRNRMLKA WEKLETFIVQ DFQWTATARH ADIVLPATTS YERNDIESVG DYSNRAILAM
KKVVDPLYEA RSDYDIFAAL TERLGKGKEF TEGRDEMGWI SSFYEAAVKQ AEFKQMEMPS
FEDFWSEGIV EFPITEGANF VRYADFREDP LFNPLGTPSG LIEIYSKNIE KMGYDDCPAH
PTWMEPAERL GGPGAKYPLH VVASHPNSRL HSQLNGTSLR DLYAVAGHEP CLINPDDAAA
RGIADGDVLR VFNDRGQILV GAKVSDAVMP GAIQVYEGGW YDPLDPSEEG TLDKYGDVNV
LSLDVGTSKL AQGNCGQTIL ADVEKYAGAP VTVTVFDTPK DA