Gene Rsph17025_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3206 
Symbol 
ID5085955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp65637 
End bp67853 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content69% 
IMG OID640484778 
Producthypothetical protein 
Protein accessionYP_001169395 
Protein GI146279237 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.799224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.954215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC ATCTCATCTT TGACGAGCCG GATGATCGCA ACCGTCTGGA CAGCATGGCG 
CAGGATGTCA TCGGGCGGCC CATCGACCGC TCCGAGGGCG CGCTGAAGGT CTCCGGCCGC
GCGGTCTATG CCGCCGAACA GGCCACGAGC CTCTGGTCCG AACAGCGCAC CGACTGCCTC
TTCGGCATCC TCGTCCGTGC GACCGTGGCC CGGGGGCGGG TCACCTCCCT CGACACGGCC
GATGCCCTGG CCGTAAGCGG CGTGCGCGAG GTGCTGCAGG ACCCGCGCCT CGTCCGCCAT
CCCGCCCAGG GGATGGCGCG CAAGGCCCCG CCGCAGGAGG TAACCGATGT CGCCTGCTTC
GGCCAGCCGA TCGCCCTCGT CGTGGCCGAC AGCTTCGAGG CCGCGCGCGA GGCGGCCCGC
CTCGTCCGTG TGGATTATGA GACGGCCGCC GCAGAGCTCG ATCCCGCATC GGCACCCTCC
GATCTGCCCG AGAAGAAGCA GTCCTCTCAG GGCGATCTCG ATGCGGCGAT GCGCGATGCC
TCGGTGTCCC TCGATGTCAC CTATCGGACC CCTTCGATGG CGGCCGCGGC CATGGAGCCT
CACGCGGCCC TGGCCTGGTG GGAGGGCGAC CGGGTGACGC TCTGCGGCGC CTATCAGATG
GTGTCGCAGA ATGCCGAGGA ACTGGCGGAT GCCCTTGGGA TCGCGCCCGA GAAGGTGCGG
GTGCTGTCGC CTTACATTGG TGGCGGTTTC GGCTCGAAGC TCGGGATTGC GCCCGAAGCG
GTTGCGGCGG CGGTGGCGGC CGAGCGGCTT GGCCGGCCGG TGATGGTGCC GCTCACCCGG
CAGCAGGAGT TCGAGATCGC CCATCGCAGG TCCGAGACCG AGCAGCGGGT GCGCCTTGCG
CTGGACGCGA ACGGGCTCTT GACGGGGATC GGGCACGAGG CGCGCGTTTC CAATCAGCCG
GGAGAGACCT TCTCCGAGCC GGTGCAGCAG GCCACCCACT ACACCTACCG CGGAGAGCAC
CGCAGCATCC GCCACGAGGT CGCGCGGGTG AACCTGACCT GTGCCGGCTC GGTCCGGGCG
CCGGGCGAGG CCGTGGGCGT CACGGTGCTG GAACTCGCCA TGGACGAACT GGCCGAAAAG
GCGGGCATCG ACCCGGTCGA GCTGAGATTG AGAAACATTC CCGAGTGCGA TCCCGAGAGC
GGGAAGCCGT TTTCATCGCA TATGCTCGCC GAGGCGCTGC GCGAAGGGGC CGATCGCTTC
GGCTGGACCT CGCGCCGTCA GCGCGGCCGG CAGCGCGCGG GGGAGTGGCT GATCGGTTGC
GGCATGGCGG CCGCCGTGCG GGTCAACTTC CTGCACAAGG CCCGGGTCCG AGTCACGCTC
AGCGCGGAGG ACCTGACGAT CGAGAGCAGC ATGACCGACA TCGGCACGGG CACCTACACG
ATCCTCACCC AGATCGCCGC CGAGATGCTT GGCATCCCGC CCGAGCGCGT GACCACCCGG
CTCGGGGATA CGGACCTGCC CGAAGGATCG GGCTCCGGCG GTTCGATCGG TGCGGCCTCG
AACGGTTCGG CCACCTACCT CGCCTGCGAA GAGATCCGCC GGCAACTGGC AAGCCGGCTG
GGCTGCGCCG AGGAGGATCT TACCCTCAGG GACGGCCGCG CAACGTGTGG AAACCGCTCG
CACGATCTGG CCGAACTCAT CCACGAGCCG CTCGCTGCGG AAGGGTTTTT CGAACCAGGA
AAGGCCATGG GGCGCGTCCG CAGCTCGACC TGGGGGGCGC ACTTCGCCGA AGTGGCGGTC
AACGAGGTGA CGGGCGAGGT GCGCGTGCGC AGCATGCTCG GCGTCTTCGC GGCGGGGCGC
ATCCTGAATG AAAAAACCGC CCGGTCGCAG TGCATCGGTG GGATGACCTT TGGTCTTGGC
ATGGCCCTGA TGGAAGAGAT GGTCCATGAC GGCCGGCACG GCCATGTGGT CAGCCGCGAT
CTGGCGAACT ACCACATCCC GACCAACGCC GATGTGCCGA GGATCGAGGT GCATTTCCTT
CACGAGCGCG ACGCCTACGG AGGCACCCTT CAGGCGAAGG GGATCGGCGA ATTGGGGATC
TGCGGTGCGG GGGCGGCCGT GCTCAACGCG ATCCATGATG CCTGCGGCGT GCGGGTCCGC
GAGCTTCCCG CCACGCCGGA CAAGATCCTC GCCGGTCTGT GCGGGATGCA GGGCTGA
 
Protein sequence
MTAHLIFDEP DDRNRLDSMA QDVIGRPIDR SEGALKVSGR AVYAAEQATS LWSEQRTDCL 
FGILVRATVA RGRVTSLDTA DALAVSGVRE VLQDPRLVRH PAQGMARKAP PQEVTDVACF
GQPIALVVAD SFEAAREAAR LVRVDYETAA AELDPASAPS DLPEKKQSSQ GDLDAAMRDA
SVSLDVTYRT PSMAAAAMEP HAALAWWEGD RVTLCGAYQM VSQNAEELAD ALGIAPEKVR
VLSPYIGGGF GSKLGIAPEA VAAAVAAERL GRPVMVPLTR QQEFEIAHRR SETEQRVRLA
LDANGLLTGI GHEARVSNQP GETFSEPVQQ ATHYTYRGEH RSIRHEVARV NLTCAGSVRA
PGEAVGVTVL ELAMDELAEK AGIDPVELRL RNIPECDPES GKPFSSHMLA EALREGADRF
GWTSRRQRGR QRAGEWLIGC GMAAAVRVNF LHKARVRVTL SAEDLTIESS MTDIGTGTYT
ILTQIAAEML GIPPERVTTR LGDTDLPEGS GSGGSIGAAS NGSATYLACE EIRRQLASRL
GCAEEDLTLR DGRATCGNRS HDLAELIHEP LAAEGFFEPG KAMGRVRSST WGAHFAEVAV
NEVTGEVRVR SMLGVFAAGR ILNEKTARSQ CIGGMTFGLG MALMEEMVHD GRHGHVVSRD
LANYHIPTNA DVPRIEVHFL HERDAYGGTL QAKGIGELGI CGAGAAVLNA IHDACGVRVR
ELPATPDKIL AGLCGMQG