Gene Rsph17025_0725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0725 
Symbol 
ID5084084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp734889 
End bp737888 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content56% 
IMG OID640482283 
Productphage integrase family protein 
Protein accessionYP_001166936 
Protein GI146276777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.710298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCG TTGAAACCCG AGACAACGAT TGGATCATCG ATCACGTGAC GGAAATCCAT 
GGCGTTAGGG TCAACACAGA ACCAAGCCGG CTTGCGCTGC TGTCCCATAG CACGAGCGAA
TGGCAGAGGG CCACCCGAGA AGCTTTGAAA TTTTCTCCTG TTATGGGAGA TGAGACCGCT
AAGCTCGATC TAATCGCAAC CGCCACCAAC ATGGCGACGG TCAGGCAAAT ACTGAACGAA
TGCCTTCTCG AAGTCGCTCA GCAAGGCGGC GGAAGTGAGA GCACTCGCGC CGCGCTGCAA
GTGCTCTTCA TGATCACGGT GCCGAAGCGT ATGGTCCTCT TTCCTTTTCG CCCAGCCACG
AACATAAAGT GGCTCAATAA TCATCTCTCT ATGCCGCCGA GTATGCGGGC TGAGCTTGGG
CTGCCGGCGC ATGACGCCTG CCCGAAGTAC TTCAACGCAG CGGCAGCGAT ATTGTATTGC
GCAACCGACT TGCACGGCCC CGAAGATTTC ACTGACGAGT TTCTTCGAGA AGTCTACCAG
AAGGTCATAT TCTTTCTGAA GCCCAACTAC ATGGCCGCAC TTGGTAACCT TCTGCGGAAG
ATTGCCGGCG CGAGCAAGGT TGAAAGCGCC ATGCGCGAAG CGCGTCGTCT AAGACCCAGC
CCGCGCTTCG CGCGGCAGCG CCCAGATCAG CCTTGGATCA TTGACCCCGA AACTGGCGAT
CCGCGATTCT CAGAATGGCA AACGGCGATA GCGGATTGGC TTCAAAATCT GACACACAAG
AGTTCGCGAA AATCTGAAAG CGCCGCCAAA CGCTTCGTTG CCTATATAAG GCAATGCGAC
GACCCTCCCC TTTCCCCCCA ACTTCTTCGT CGCTGCCACG TCGTTAAAAC ATCGGAAACT
CGCGAAGTTC CGTTCATTGA CTGGCTGATG ATTGAGAAGA GCATCAACGC CGCGCGTGAA
TCCTGCGGTC GTCTGCGGGA TCTGATGGCG CATTGGCATG ACACTCATCC CATTCTGGGG
AGCGATGAAG TGTGGAATAA TCCTTTCCAA ACCTCCGATC TAGAAGGTCT GCGCGGTTCC
AGCACCCACA ACAAATCCGT CAAATACGTC TTGCCTCAGC AGATTATCGA AATCGCAAAG
CAGGTCCTGC TTGAAGACGA CTACGCCTGG CCGCGCCGGC AGAAAATGTG TCGCGCCCGC
GCTCTTGGCC GTCCTGATCT GTTCTCGCCG GTCCTTCCCG TCGCTATCTT CACTCTCCTA
GTCCTGCCGC TCCGATCCAT TCAGTTGCGC CTGCTCGATT CCGGAGAGGC TGACGAACTG
CAACTCGACG AACGCTTGGA GCGCGCGGTC AACACGTCAA CCCACGCGAT CCGCGGCCGA
CAGGAGGGTT TCCTCAAACG CTTCGAACCC TTTGGAGGTC CCGGCAGCGA ATTCGTGGGC
TTTCACATCA CGACCAACAA GACCGACGCC GCGCAAAAAG GGGAGGTTGA GATCCCGTTC
GACATCCCAT GGCAGGCGGC AGAACTTATA CCGCACATAA GGCGCCTCCA GCAATGGCAG
GCCGAAGTAA AACCACTCGA CCGGTTGCGA ACTCGGAAGG AGCTGCACGA GAGGGAAACG
CGTATTGATG GAAAGATCGC AAATCTACCG AAATACGCGT TCCTGTTCCG TGATCCGGGA
GGGCAATATC CAAAGGAACC GGTGACCTAC AATAAGTTGG CGAACTTCTG GGCGTGCGTT
CTGGAAGAAG TTGAGAACCG CCTTCGGCGG GCGGGCACGC CGGTCAGACT TATCGAGGGC
CGGACCCGAC AAGGAAAATC GGCGCAACCT GTCGCGATCT ACACGTTGCA TGCTATGCGA
GTGAGCGGCA TTACCTATTT TATCGAGTCA GGAGTACCGA TACATATCGT ATCTGAATTT
CTGGCCGGTC ACGCCAACAT CATCATTACT CTCTACTACA CGAAGATCGC GCCGGCCAAA
GTCAACGAAA TTATTGAACG CGCAGCAGAT CAAAGCCGAG AACTCAGTGA AAGGGGCTAC
TTCGCGGCGC TTTCCGAGTT GTCCAAGGCC GAATTAAAAG GGCAACTCGT GGTGTCCGAT
GGGGCTGCGG CCCACGTCAG CGGCGGCGAT CCTGGAGTTT GGCATCTCGA CGTTGATGGG
TTCTGCGTCT GCGGTCGCAC GCAATGCGCA GAGGGAGGAG CAAAGCTGAA GAATGCACAG
GGAAAAGAGT ATTTTGAAGG CATAACTTTC GACAGGTTCA ACTGCGGCGC GTGTCGGTTT
CACGCGACCG GACCCGCCTT CCTTGCCGGC CAGGTTCTAG TCTTTAATAG CTTGCTCCAC
GAGATTCATC TCCTCGCGAA GAAGCGTGAC GATTTGCAAA ATCGGTTCGA CTTGCGGCAG
GATGACTGCG TCAGGGCAAA ATCCGACATG ATCCACCGCC TGGAAAGGAT CGATCAGGAA
ATGGACAGCA AGGTACGTGT CCTTGCGTCT CGCTTCACGA GGTTGCAGCA ATCCTTGGAA
CTTCATGAGC GCAAACGCGC GGGCGGGAAA CAGGCCCTCA TCACCGGACT TGACCGCCGC
GAGGTCTCCG TGGCAATCGA GAGCGCGCGG GACTCGGACT TGCTGGATTT CGCGGCCCAT
GTGGCAGAAT TCTTTCCGGA ACTCACAGAC GACTCGGCGC GGATGAGGAA GGGGCTGCTT
CTCGAACGCA TGGCGAGCCG CGATGGATTG GGCTCTCTGT TCTTACATTT ACCCGACGCC
ATCGCGCTTT CTGCGGCCAA TCGTTTTACT GAACTCGTCA AAGACTTGAT TGGCAGTAAT
GCACTTACAA ACTTGATGGA GGGCGCGGCC ACATTTCGAG AACTGGGTTT AATCGACCAA
GTACCACGAC TCAAGGATAG GGCGATAGAA CTTGCTCGAG AGGCTCATCG CAAGCTGGCC
GACTCGGAGG GTCTGGCTCA GATCGAGGCC CGTCGCGACA GCACCCAAGT TCGCGGCTGA
 
Protein sequence
MDSVETRDND WIIDHVTEIH GVRVNTEPSR LALLSHSTSE WQRATREALK FSPVMGDETA 
KLDLIATATN MATVRQILNE CLLEVAQQGG GSESTRAALQ VLFMITVPKR MVLFPFRPAT
NIKWLNNHLS MPPSMRAELG LPAHDACPKY FNAAAAILYC ATDLHGPEDF TDEFLREVYQ
KVIFFLKPNY MAALGNLLRK IAGASKVESA MREARRLRPS PRFARQRPDQ PWIIDPETGD
PRFSEWQTAI ADWLQNLTHK SSRKSESAAK RFVAYIRQCD DPPLSPQLLR RCHVVKTSET
REVPFIDWLM IEKSINAARE SCGRLRDLMA HWHDTHPILG SDEVWNNPFQ TSDLEGLRGS
STHNKSVKYV LPQQIIEIAK QVLLEDDYAW PRRQKMCRAR ALGRPDLFSP VLPVAIFTLL
VLPLRSIQLR LLDSGEADEL QLDERLERAV NTSTHAIRGR QEGFLKRFEP FGGPGSEFVG
FHITTNKTDA AQKGEVEIPF DIPWQAAELI PHIRRLQQWQ AEVKPLDRLR TRKELHERET
RIDGKIANLP KYAFLFRDPG GQYPKEPVTY NKLANFWACV LEEVENRLRR AGTPVRLIEG
RTRQGKSAQP VAIYTLHAMR VSGITYFIES GVPIHIVSEF LAGHANIIIT LYYTKIAPAK
VNEIIERAAD QSRELSERGY FAALSELSKA ELKGQLVVSD GAAAHVSGGD PGVWHLDVDG
FCVCGRTQCA EGGAKLKNAQ GKEYFEGITF DRFNCGACRF HATGPAFLAG QVLVFNSLLH
EIHLLAKKRD DLQNRFDLRQ DDCVRAKSDM IHRLERIDQE MDSKVRVLAS RFTRLQQSLE
LHERKRAGGK QALITGLDRR EVSVAIESAR DSDLLDFAAH VAEFFPELTD DSARMRKGLL
LERMASRDGL GSLFLHLPDA IALSAANRFT ELVKDLIGSN ALTNLMEGAA TFRELGLIDQ
VPRLKDRAIE LAREAHRKLA DSEGLAQIEA RRDSTQVRG