Gene Rsph17029_3945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3945 
Symbol 
ID4899122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1078728 
End bp1080947 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content71% 
IMG OID640114548 
Productaldehyde oxidase and xanthine dehydrogenase, molybdopterin binding 
Protein accessionYP_001045795 
Protein GI126464682 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC ATCTCGTATT CGACGAGCCC GACAAGCGTA ACCGGCTGGA TGCGATGGCG 
CAGGAGGTGG TCGGGCGCCC CATCGACCGC TCGGAAGGGG CGCTGAAAGT GTCCGGACGG
GCCGTCTATG CGGCCGAACA GGCGTCGAGC CTTTGGTCCG AGAAGCGCGC CGACTGCGCT
TACGGCGTCT TCGTGCGGGC CACCGTGCCC TTGGGCCGGG TCGCTTCGCT GGCCTCCGAC
GCGGCGCGCG CGGTTCCCGG CGTGCTGAAG GTGCTGCAGG ATCCCCGCCT CGTGCGCCAT
CCGGCGCAGG GCATGGCGCG GAAGTCGCCG CCGCAGGATC CGTCCGAGGT GACCTATTTC
GGCCAGCCCC TCGCGCTGGT GGTGGCCGAA AGCTTCGAGG CCGCCCGCGA AGCCGCGCGG
CTGGTGCGGG TGGACTACGA GACCTCGGCC ACAGCTCAGC TCGACCCCGC CCAAGCACCC
TCGGAATTCC CCGAGAAGAA GCAGTCGGAA AAGGGGGATC TGGAGGCCGC CCTGCGCGAG
GCCGCCGTTT CGCTCGATGC GACCTACAGC ACCCCCTCGA TGGCCGCCGC CGCGATGGAA
CCGCATGCGG CCCTCGCCTG GTGGGAGGGC GACCGGGTGA CCCTCTGCGG CGCCTATCAG
ATGGTGTCGC AGAATGCCGA GGAGCTGGCC GATGCGCTGG GGATCGCGCC TGGGAAGGTG
CGGGTTCTCG CGCCCTTCAT CGGCGGCGGC TTCGGCTCGA AGCTCGGGAT CGCGCCCGAG
GCCGTGGCGG CGGCCGTGGC GGCCGAGATG CTCGGCCGGC CGGTGATGGT GACGCTCGCC
CGGCAGCAGG AGTTCGAGAT TGCCCACCGC CGCTCCGAGA CCGGGCAGCG GGTGCGCCTC
GCCCTCGATG CGGACGGACG GCTCACCGGG ATCGGCCACG AGGCGCGGGT GTCGAACCTC
GACGGGGAGA GCTTCTCCGA GCCCGTGCAG CAGGCGACCC ACTTCACCTA TCGCGGCGAG
CATCGTCAGA TCCGGCACGA GGTGGCGCGG GTGAATCTCA CCTGCGCGGG ATCGGTGCGC
GCGCCGGGCG AGGCGGTGGG GGTCAGCATC TTCGAGATGG CGCTCGACGA GCTGGCCGAG
AAGGCGGGCC TCGATCCGCT CGAGCTGCGG CTTCGCAATA TTCCCGAAGA GGATCCCGAG
ACCGGCAAGC CCTTCACCTC GCACATGCTG GCCGAGGCGC TGCGGGACGG CGCCGACCGG
TTCGGCTGGT CGGACCGACG CGCGCCGGGC GAGCGGCGGC AGGGCGAGTG GCTGATCGGG
ATGGGCATGG CCTCGGCCGT GCGGGTGAAC ATGCTGCACA AGGCGCGCGT GCGCGTCACG
CTGACGGGCG AGGGCGCCCT CGTCGAGAGC AGCATGACCG ACATCGGGAC GGGCACCTAC
ACGATCCTCA CCCAGATCGT GGCCGAGATG CTGGGCCTCG CGCCCGACCG GGTCCGCACG
GTCCTTGGCG ACACCGATCT GCCGGAGGGC TCCGGCTCGG GCGGCTCGAT CGGCGCGGCC
TCGAACGGCT CGGCCGCGTT CCTCGCCTGC GAGGAGATCC GCCGGCAGAT CGCGGCGGGG
ATGGGTTGCC CCGAGGAGGA GCTGACCCTG AAGGACGGGC GCGCCACCTG CGGCAACCGG
ACCCGCGATC TGGCCGAGAT CGTGACCGAG CCGCTGTCGG CCGAGGGCCA TTTCGAGCCG
GGCAAGGCCT TGAAGGGCGT CCGCAGCTCG ACCTTTGGCG CGCATTTCGC CGAAGTGGCG
GTCAACGAGA TCACGGGAGA GGTGCGCGTG CGCCGGATGC TGGGCGTCTT TGCCGCCGGG
CGGATCCTGA ACGAGAAGAC CGCCCGCTCG CAATGTCTGG GCGGGATGAC CTTCGGCATC
GGCATGGCGC TGATGGAAGA GATGGTGCAC GACCGCCGCT TCGGTCAGGT GGTGAGCCGG
GATCTGGCGA ACTACCATAT CCCCGCCCAT GCCGACGTGC CGGCACTGGA GGTTCATTTC
CTCGAAGAGC GCGACAGCTA CGGCGGCACG CTTCAGGCCA AGGGGATCGG GGAACTCGGG
ATCTGCGGAT CGGGGGCCGC CGTCCTCAAC GCGATCCACA ACGCCTGCGG CGTCCGGGTG
CGCGATCTGC CGGCCACGCC GGACAAGGTG CTGAACGGCC TCTGCGCCCT CGGGCGGTAG
 
Protein sequence
MTAHLVFDEP DKRNRLDAMA QEVVGRPIDR SEGALKVSGR AVYAAEQASS LWSEKRADCA 
YGVFVRATVP LGRVASLASD AARAVPGVLK VLQDPRLVRH PAQGMARKSP PQDPSEVTYF
GQPLALVVAE SFEAAREAAR LVRVDYETSA TAQLDPAQAP SEFPEKKQSE KGDLEAALRE
AAVSLDATYS TPSMAAAAME PHAALAWWEG DRVTLCGAYQ MVSQNAEELA DALGIAPGKV
RVLAPFIGGG FGSKLGIAPE AVAAAVAAEM LGRPVMVTLA RQQEFEIAHR RSETGQRVRL
ALDADGRLTG IGHEARVSNL DGESFSEPVQ QATHFTYRGE HRQIRHEVAR VNLTCAGSVR
APGEAVGVSI FEMALDELAE KAGLDPLELR LRNIPEEDPE TGKPFTSHML AEALRDGADR
FGWSDRRAPG ERRQGEWLIG MGMASAVRVN MLHKARVRVT LTGEGALVES SMTDIGTGTY
TILTQIVAEM LGLAPDRVRT VLGDTDLPEG SGSGGSIGAA SNGSAAFLAC EEIRRQIAAG
MGCPEEELTL KDGRATCGNR TRDLAEIVTE PLSAEGHFEP GKALKGVRSS TFGAHFAEVA
VNEITGEVRV RRMLGVFAAG RILNEKTARS QCLGGMTFGI GMALMEEMVH DRRFGQVVSR
DLANYHIPAH ADVPALEVHF LEERDSYGGT LQAKGIGELG ICGSGAAVLN AIHNACGVRV
RDLPATPDKV LNGLCALGR