Gene Rsph17025_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4058 
Symbol 
ID5086231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp101907 
End bp103544 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content73% 
IMG OID640485621 
Producthypothetical protein 
Protein accessionYP_001170215 
Protein GI146280058 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00410401 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.150946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCC CCGGCCCCGA TCCGGGCGCC CCGGCGCTCC GGGGCGGCGC CATCCGCCCC 
GGAGGACGGC CGGGTTGGGC GGCGCGGCTC TCCGCCCTTG CGGGCCCCCT GCTCGCGGCG
GCGCTGGTCT TCGCCGGCCC CGCCCGCGCC ACGGACTGGC CGGTGGAGCA GTATGATCCC
GGCGCGGCCG AGCGTCCGGC CGACCTGATC CTGCCCATGC CCTGCGGCGG GGCGATGGCT
TTCCAGAAGG TGGTCGTGCC GGTCGAGGCC GCCGATCCGC TCGACGACCG CCGCCTGCGC
CTCGGCCAGT CGCAGCCCGA GACCGGCTAT TCCGACTATC TGCGCACCGA GCATCTGCGC
GGCCCCTTCG CCTCGGACGA GGCCACCTTC TACTACATCG GCCGCTATGA GGTGACGCGC
GCGCAGCAGC GGGCGCTGGC CTTCGACTGC GCCCCGCCCA GCCGGATGGA CCGGACCGCG
GCGGCGGGGC TGTCGTGGTT CGATGCGGTG GCGCTGTCGC AGCTCTACAG CGAATGGCTG
CTGGCCGAGG CCCCGGACGC GCTGCCCCCC GAGGCGGAGG GGCTGGCCTT CCTGCGCCTG
CCGACCGAGA CCGAGTGGGA ATATGCCGCC CGTGGCGGGG CCGCGACCGA CGCCACGCAG
TTCGCCTCGC GGCGCTATTT CTCCGAAGGT CAGATCGCCG ATCATGCGAT GGCGCAGGGC
TCGGCGCGGG GAGAGGTGCT GCCCGTCGGG CTGCGCAGGC CGAACCCGCT GGGGCTCCAT
GACATCTACG GCAATGCCGA GGAGCTGATG CTCGAACCCT TCCGGCTGAA TGCGGTCGGG
CGCCCGCACG GGCAGGTGGG GGGGCTCGTC ACCCGGGGCG GCTCGGTGCT CTCGGCCCCC
GAAGAGCTCT ATTCCGCGCA GCGGCGGGAA TATCCGCTCT ACCGCGCCGC CGACGGCAAG
GCGCTGGCCG GGGCCACCTT CGGGCTGCGC CTCGTGCTGA CGCGCGATGT CACCTCGTCG
GACGCCCGCC TGCGCGCGAT CCGCAGCCGC TGGCTCGACC TGGCCGAGGC GCCGGCGGCG
GAGGCCTCGG ATCCGCTGGT CACGCTCTCG GCGCTGATCG AGGAAGAGGC CGACCCGCGC
CGGCAGTCGG CTCTGACCGA CCTCCAGCTC GAATTCCGGC TGGCGCGCGA TGCGGCGGCG
GCGGCCTTCC GGGAATCGGC GAAATCCACG CTGCTGAGCG GCGCGGTCTT CATCGCGGCC
CTGGCCGACG GCGCGCGCGA GATCGACCGC CAGACCGGCA ATGTCCGCGC CATGGTGGAC
CAGATCCGGG TGAGCGACGG GGCGCAGCGC GAGGCGCTCA TCGCGGGGGC CGAGCGGGTG
AACCGGCAGC TGAGGATGCT GCGCGACCTG CAGCACACCT ATCTTCTGTC CTACCGCAGC
GCGCTCGAGA CCCTCTCGTC GGAGATCGAG GGCGAGGTGG TGGAGACGGC CTTCGGCCTG
CTCCAGCAGG AGCTTGCGGC CTCGGGCCAG ACCGGGATCC TGTCGGGGCT GGAGGCGCTG
AACGAGGATC TCGCCCGCTT TGCCGCCCGG CCGGACATGG TCGAGGCCGA GCTGCTGGCC
CTGGCGCTGG AACGCTAG
 
Protein sequence
MRIPGPDPGA PALRGGAIRP GGRPGWAARL SALAGPLLAA ALVFAGPARA TDWPVEQYDP 
GAAERPADLI LPMPCGGAMA FQKVVVPVEA ADPLDDRRLR LGQSQPETGY SDYLRTEHLR
GPFASDEATF YYIGRYEVTR AQQRALAFDC APPSRMDRTA AAGLSWFDAV ALSQLYSEWL
LAEAPDALPP EAEGLAFLRL PTETEWEYAA RGGAATDATQ FASRRYFSEG QIADHAMAQG
SARGEVLPVG LRRPNPLGLH DIYGNAEELM LEPFRLNAVG RPHGQVGGLV TRGGSVLSAP
EELYSAQRRE YPLYRAADGK ALAGATFGLR LVLTRDVTSS DARLRAIRSR WLDLAEAPAA
EASDPLVTLS ALIEEEADPR RQSALTDLQL EFRLARDAAA AAFRESAKST LLSGAVFIAA
LADGAREIDR QTGNVRAMVD QIRVSDGAQR EALIAGAERV NRQLRMLRDL QHTYLLSYRS
ALETLSSEIE GEVVETAFGL LQQELAASGQ TGILSGLEAL NEDLARFAAR PDMVEAELLA
LALER