Gene Rsph17025_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3475 
Symbol 
ID5085891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp353923 
End bp355131 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content64% 
IMG OID640485040 
Producthypothetical protein 
Protein accessionYP_001169656 
Protein GI146279498 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.214234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCCA CACATTTGGC CCCTTCTGCA CCCCGGACGC TGGGCCGCAG CCTTTCCTTT 
GCCGAGCGTG AGGAAATCGC TCTGGAGTGC GCGCGGAAGA CCGGGGTGCG CGCCATCGCC
CGGAAGCTGG GGCGTTCGCC GAGCACGATC TCACGCGAGA TCAGGCGCAA CTCCGCGACC
CGTGGCGGGG ATTTCGATTG CCGCGCCATC ACCGCGCAGT GGCATGCGGA CCGTGCAGCC
CAACGACCTA AGACCAGCAA GCTCGCGAAC AATCCGGCCC TGCGTGACTA CGAGCAGGAC
CGGCTCGCGG GTGTGATCGC CACGCCGGAC GGCGTCGCCT TCGACGGGCC TGTCGTGGTA
TGGAAGAAGC GGCGGGCGGT TCACCGGCAA AGCCGACGAT GGTCCTTGGC ATGGAGCCCG
GAACAGATCG CGCAGCGATT GAAGATGGAC TTTCCCGAGG ACCCGACCAT GCGCATCAGC
CACGAAGCGA TCTATCAGGC GCTCTACATT CAGGGGCGTG GCGCGCTGAA GCGGGAGCTT
TCCGCCTGCC TGCGCTCGGG CCGAGCGCTA CGTCTCCCCC GGGAGCGCGC CCGAAACCGA
GGCAAGGCTT TTGTCGGGGA TGCATTGATG ATCAGCGATC GCCCGGCGGA AGTGGGCGAC
AGGGAGGTGC CGGGCCACTG GGAGGGAGAT CTCATCCTTG GCCTCGGCAG TTCGGCAATC
GGCACCCTGG TCGAACGGAC CACGCGCTTC ACGATGTTGC TGCATCTGCC GCGGATGGAG
GGCCATGGAG CGACCAGGTC CATCAAGAAT GGACCGGCTC TTGCCGGTCA TGGTGCCGAA
GCCGTTCGCG ATGCAATTGC AGACACCATC ATGGACCTGC CCGCACACCT GCGCCGGTCC
CTGACTTGGG ATCAGGGCGC CGAGATGGCG CAACACGCCC GGCTCAGGTT CGACACCGGC
CTCGATGTCT ACTTCTGCGA TCCACGAAGT CCATGGCAAC GAGGCAGCAA CGAAAACACC
AATGGTCTGC TCCGCCAGTA CTTTCCCAAG GGAACCGATC TCAGCCAGTA CGGTGTGGAC
GCATTGAATG CCGTTGCCCA CGCGCTGAAC ACGCGACCGC GCAAGACGCT CGGCTGGCAA
ACGCCGGCAG AAGCACTCGA CCGACTGCTC AAAATGGATA CGATCGAACG TGTTGCGACG
ACCGGTTGA
 
Protein sequence
MPPTHLAPSA PRTLGRSLSF AEREEIALEC ARKTGVRAIA RKLGRSPSTI SREIRRNSAT 
RGGDFDCRAI TAQWHADRAA QRPKTSKLAN NPALRDYEQD RLAGVIATPD GVAFDGPVVV
WKKRRAVHRQ SRRWSLAWSP EQIAQRLKMD FPEDPTMRIS HEAIYQALYI QGRGALKREL
SACLRSGRAL RLPRERARNR GKAFVGDALM ISDRPAEVGD REVPGHWEGD LILGLGSSAI
GTLVERTTRF TMLLHLPRME GHGATRSIKN GPALAGHGAE AVRDAIADTI MDLPAHLRRS
LTWDQGAEMA QHARLRFDTG LDVYFCDPRS PWQRGSNENT NGLLRQYFPK GTDLSQYGVD
ALNAVAHALN TRPRKTLGWQ TPAEALDRLL KMDTIERVAT TG