Gene Rsph17025_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2101 
Symbol 
ID5083560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2140683 
End bp2142314 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content70% 
IMG OID640483664 
Producthypothetical protein 
Protein accessionYP_001168297 
Protein GI146278138 
COG category[S] Function unknown 
COG ID[COG4383] Mu-like prophage protein gp29 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0242078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA CACCCGTCCT TCTCGACCGC TGGGGCAAGC CCGTGAAGCG CGCGGTCCTG 
ACAGAGGAGA TCTCGGCCGC CACCCTGGGC AGCGTGCGCA GCCCGATCAC CGGCTATCCG
GCCGACGGGC TGAACCCGGT GCGGCTGGCC TCGATCCTGC GCGAGGCCGA CGCGGGCGAC
CCGGTGCGCT ATCTCGAACT GGCCGAGACG ATCGAGGAGC GCGACCTGCA CTACCTCGGG
GTCCTTGGCA CCCGGCGCCG ATCGGTCAGC CAGCTCGACA TCACGGTCGA GGCGGCCTCG
GACGATCCGC GCGACGTGGA GATCGCCGAC ATGATCCGCG ACTGGCTCAC GCGCGACGAG
CTGTCCGACG AGCTCTTCCA CATGCTCGAC TGCATCGGGA AGGGCTACAG CTTCACCGAG
ATCATCTGGG ACACCTCCGA AGGCCAGTGG CGCCCGGCGC GTCTGGAGTG GCGGGACCCG
CGCTGGTTCC GCTTCGACCG GGCCGCTCTG ACCACGCCGC TGATGCTCGG CCCGCACGGC
GAGGAGCTGG AACTGACGCC GTTCAAGTTC ATCTTCGCCG AGGTGAAGGC CAAGTCGGGG
ATCGCGTTGC GGTCGGGTCT GGCGCGGGCC GCGGCCTGGG CGTGGATGTT CAAGGCGTTC
ACCCAGCGCG ACTGGGCGAT CTTCACCCAG ACCTACGGCC AGCCGCTGCG CCTGGGCAGA
TACGGCCCCG GCGCGTCCGA AGACGACAAG GCCACGCTCT TCCGGGCGGT GGCCAACATC
GCCGGCGATT GCGCGGCGAT CATCCCCGAG TCAATGGCGA TCGACTTCGT CGAGACGAAG
TCCGTGGGCG CCACGGCCGA TCTCTACAAG CAGCGGGCCG ACTGGCTCGA CCAGCAGATC
TCGAAGGCGG TGCTGGGCCA GACCGCCACG ACCGATGCCG TGACCGGGGG GCTGGGGTCC
GGGAAGGAGC ACCGGCAGGT GCAGGAGGAC ATCGAGCGCG CCGATGCGAA GGCGCTCTCG
GGCATCCTGA ACCGCGACCT GATCCGGCCC TGGGTGGATC TGGAATACGG GCCCCAGGCG
CGCTATCCTC GGCTCAAGAT CGCGCGGCCG GAGCCCGAGG ATCTGAAGGC GATGGCCGAG
GCGCTCGCAG CCCTCGTGCC GATCGGCCTC AGGGTCAGCC AGAAGAAGAC CCGTGACCGT
TTCGGCTTCG ACGAACCCGA AAACGACGCC GATGTGATGG GAGGAACGCC CGCCGCCGCA
GCCGTCGCGG CACCCCCGGG CGCGGATCGG CCGATTAAAC GGTTTTCCGG CGTTTTTAAA
GGGGGCGAGC CCCCGGCGCG ACCCGAGACA GCCCTGCAGG CGGAAGCGGC TCCAGCGGCC
CTCCCAGCGA GTGACGATCC GGCGGCGCTG CTGGCGGATC GGCTGGCGGC CGACGCGGCG
CCGGCCATGG GCGCGATGAT CGAGCGGGTC GAGACGATGC TGGCGGCCGC GGGTTCGCTG
GCCGAGTTCC GCGAGATGCT GCTCGCGGGC TTTCCCGGGA TCGACGCGGG CGACCTGGCC
ACCCTGATGG CGCAGGCGAT GATGGCCGCT CATGCCGGGG GTCGTGCGGC GGCGGAGGAT
GCCGGTGCCT GA
 
Protein sequence
MAKTPVLLDR WGKPVKRAVL TEEISAATLG SVRSPITGYP ADGLNPVRLA SILREADAGD 
PVRYLELAET IEERDLHYLG VLGTRRRSVS QLDITVEAAS DDPRDVEIAD MIRDWLTRDE
LSDELFHMLD CIGKGYSFTE IIWDTSEGQW RPARLEWRDP RWFRFDRAAL TTPLMLGPHG
EELELTPFKF IFAEVKAKSG IALRSGLARA AAWAWMFKAF TQRDWAIFTQ TYGQPLRLGR
YGPGASEDDK ATLFRAVANI AGDCAAIIPE SMAIDFVETK SVGATADLYK QRADWLDQQI
SKAVLGQTAT TDAVTGGLGS GKEHRQVQED IERADAKALS GILNRDLIRP WVDLEYGPQA
RYPRLKIARP EPEDLKAMAE ALAALVPIGL RVSQKKTRDR FGFDEPENDA DVMGGTPAAA
AVAAPPGADR PIKRFSGVFK GGEPPARPET ALQAEAAPAA LPASDDPAAL LADRLAADAA
PAMGAMIERV ETMLAAAGSL AEFREMLLAG FPGIDAGDLA TLMAQAMMAA HAGGRAAAED
AGA