Gene Rsph17025_4244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4244 
Symbol 
ID5086539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009431 
Strand
Start bp434 
End bp3787 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content72% 
IMG OID640485805 
Producthypothetical protein 
Protein accessionYP_001170399 
Protein GI146280243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.172375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGA CCGGCGCGCT GTCGGGCGAG CTGACCTTTC GGGACCTGTC GCTCCTGCCC 
TCGGGTCTGG GGCGCGGCGC GCGGGCCACC GGGTCGGGCA GCATCCGGTT GCGCGACGGG
CGCCTTTCGG GCGACGGGTC GGCCGAGATC ACCTATCAGG ATCTGGGCAA CGGCAACGTC
CGCTTTGCCT TCACCGAGGC CGGCGCCTTC ACCGCAGAGG GCACCTTCCG CGTCACGCCG
CCCTTCGTCA ACGAGGTCAG CGGCGATCTG GCGGCCGACG AGGCCGGCAA CCTGACCGCG
AACGCGCGGA TCGGCGTGGG CGACATGCGC ACCAGCCTGC CGGGCCTGTC GCTGACGGGG
GGAACGCTGA CGCTCGGCTA CCTGAACGGG CGGCCCTCGG GCGGGATCGA GGGCTTTGCG
GCCACCTACG CGGGCCTCGG CTCGGTGACG CTGGAGGAGG CCACGATCGC CGCCTCGGGC
TTTGCCGGAA CCGGGCGCTT CGCGCTCGAG GTGGCGGGGC TGAACGAGGC CTCGGGCCGG
GTCACGATCC GCAACGGCCG CGTCTCGGGC CTGTTGCGGC TGGGGGCCGA TGCCTTCCCG
GCCGGACTGC CGGTGCGCCG CCCCTCGCTG ACGGTGCGGC TGGCCGAGAC GGGCCGCGTG
GGCGTCGATG GCTCGGCCAC CGTCGATCTG GGCCCGGCCG GCACGGGCAG CTTCTCGGCC
GCCTATTCCG AGACCGGCGC CTTCAGCTTC GGCGGCGACG TGACCCTGAC CATTCCGGGC
CTGACCACCG TGACCGCCCG CGTGGGCTAC GCCAACGGCG AGATCTCGGG CGAGGTTCAG
GTGCCGGTGA ACACGCAGCT TCTGCCCGGT CTCGACGGCT CGGTGACGGT GCGCTACGCG
CAGAACCGCT GGTCGGGCGA GACCACGCTG AACTTCGCGG CCGACAACGG CAAGCTGTCG
GGCACCGTCA CCGTCACCGT GGCGCAGACC GAGGCGGGCG CGCTGGAACT GGGTGGCGAG
GGGCGGGTGA CGGCCCAGCT TGCGCCGCGC CTGCAAGGCA CGCTGACCGC GCGCATCCTG
CCCGAAGGGG CGGTGGACAT CTCGGGCGTG ATCGAGGTGA CCGAGCCGCT GGAGCTGTTC
CCCGAAAAGC GGATGGACCG CGAGCTGTTC CGCTATGCGC AGAACATCCC GCTCTGGGCG
ATCCTCGTCG CGGTGATCCG CATCCGGGCG GGCGTGCGCG CGGGCGTGGG GCCGGGCGTC
TTCCGCAACA TCCGCGTCGA AGGCTCCTAC ACGATCGGCT CGGCCGAGGC CGACCCCTCC
TTCACGATCT CGGGCGAGCT GTTCGTGCCG GCCTTCGTCG AGGGCTATGT GGCGATCGGC
GCGGGGCTCG GGCTCGACGT GGTGCTGGGC GAGCTGACCG GCGGGATCGA GGCGGTGGGG
ACGGCGGGCC TTTACGGCGC GATCTCGGTC GAGCCGGCGC TGACCTACGC CGATGGCGAC
TGGGGCATCG AGGGGGTGGC GACGCTCGCC GCGGGGGCGC GGCTCAAGCT CGGGCTGAAC
GCCTGGGCCG AGATCGAGGC GCTCTGGGTC ACGGTCTGGG ACAAGCAGTG GAAGCTGGCC
GAGGTGGTGA TGCCCGTCGG CCCGGATCTG GGGCTTCAGG CGCGCATGTC CTACAAGTTC
GGCCGCCCCG AGCCGCCGAC CATCGAGATG ACCTCGTCCG AGATCGACAC GGCGCGCCTT
GTGCAGGATG CGATGCCCAA GGACGGGCCC GCCCCCTCGG GCGCGCGGGA GGCGTTGCAG
AACAAGGCCG AATGGAAGGG CCAGCTGCAG GCGCAGCGCG CCGCCGCGGT GCCGCCCGAA
CAGGTGGCGC AGCAGGCCGA GCCCGCCACC CCGCCGCCCG CCCCGCCACG GCCGCCCAAG
GCCGCGGGCG GGCCGCCGGC CGCCACCCCC GCCGTGGGGC CCGCGGCCCC CGCCACGGCG
CAGCAGACGC AACCGGGCGC CAGCCCGAAC GAGGCGAACC TGCCGGCCCG CAGCGACGCC
GTGGATCGGG CCGCCGCCCC CGACAGCAAC ATCCCGGCCG CCGTTCCCGA AGGCTCGCTG
CCCGGGGCCG ACCAGCCACG CTATCCGCAT CCGATCACGC TGAAGATGCT GGACGAACCG
CCGGCCTCGA TCCCGCGCAC CCTCTCGCAG GAGGCCGAGG ATGTCGCGGC CGCCAGCCGG
ATGGTCGAGC TGGCCAGCGC GCAGGCGACC GACAGCGACG CGCTCGACAA TTACTTCCCG
CGGATCAAGC AGCGGTTCGG TCTGGTCTCG CTCGGCTACG AGGGCGACTT CCAGCGCGGC
TTTGCCGTGG TCGGGCGGAT CAACCCGGAG TTCAAGCGCC GGGTGGCCGA ACCCCTGAGC
GGGACCGGGC TGCCGGGCGC GCTGGCCTCG GGGCATGTCA CGAAGATCGT CCACGAGCAC
AGCCAGCTGG GCGGCGCGCA GGTCGGGCTG ACGATGCGGG CCCGGCCGCT GGGGCCCGAC
CACCAGCAGG GCTCGGGGCC CACGGGCCAG GACGCGCTGA TGGCGCAGCT GCCCACGGAC
CCGCGAATCT ATTCCGACAC CGCGCAACGC TATGTCCGCG GCCACCTTCT GAACGATCAC
ATCGGCGGGC TCGGGCATCC GATGAACCTC TTTCCGATCA CCGCCTCGGC CAATGCCCAG
CACGAAAGCG CGGTGGAATC CTACGCCAAG GACTGGGTGA ACAACCGCAA GCTCTGGATC
GACTACACGG TCGAGGTGAA GGCCAGGCCC GAACTGAGCC GGGCGACCGG AGGGCTGAAG
AAGATCGACG CCGTGATCGA CGCGACGGCC GCGGCGCTGG ACACCAACCT CGACCGCATC
CCCAACCTCA CGCGGCATGT CACCATCGCC TCGACCTACC GGACCGCCTC GCAGGCCGAG
GAGGGCGAGT TGAACGACTT CACGCAGGCG CTGGTCGATC CGACCGCGGC GGCGCTGCAG
GCCGAGCGGC CGCAGGATCA GGCCCTGACG GCCCCGCGGT CCTCGCGCGA GACGCCGACC
AGCTTCCCGC CGCACATCGG CGCCGCGATC GCGGCCGCCG TGGTCAAGCT GGGATCGCGG
AGCCGCGTGG CGGCGGTGCT GCAGGATCAT CCGGGCTTCG GCGATGTCTC GGAGGAGGTG
CTGTTCGAGG TCTATGACCG GATCGGCACG ACCGGCGCCA CGGTCGATTT CCTCGGCTCG
CCGGCCGAAA GGGGCGTGCT GACGCGGATC ATCAACGCCT GGGGCGGCAA GGACGGCATC
GGCGCCCGGC TGGAACGGGC GATTGCCTCG GCCGGGTCGG GCGCTGGCCC GTGA
 
Protein sequence
MGETGALSGE LTFRDLSLLP SGLGRGARAT GSGSIRLRDG RLSGDGSAEI TYQDLGNGNV 
RFAFTEAGAF TAEGTFRVTP PFVNEVSGDL AADEAGNLTA NARIGVGDMR TSLPGLSLTG
GTLTLGYLNG RPSGGIEGFA ATYAGLGSVT LEEATIAASG FAGTGRFALE VAGLNEASGR
VTIRNGRVSG LLRLGADAFP AGLPVRRPSL TVRLAETGRV GVDGSATVDL GPAGTGSFSA
AYSETGAFSF GGDVTLTIPG LTTVTARVGY ANGEISGEVQ VPVNTQLLPG LDGSVTVRYA
QNRWSGETTL NFAADNGKLS GTVTVTVAQT EAGALELGGE GRVTAQLAPR LQGTLTARIL
PEGAVDISGV IEVTEPLELF PEKRMDRELF RYAQNIPLWA ILVAVIRIRA GVRAGVGPGV
FRNIRVEGSY TIGSAEADPS FTISGELFVP AFVEGYVAIG AGLGLDVVLG ELTGGIEAVG
TAGLYGAISV EPALTYADGD WGIEGVATLA AGARLKLGLN AWAEIEALWV TVWDKQWKLA
EVVMPVGPDL GLQARMSYKF GRPEPPTIEM TSSEIDTARL VQDAMPKDGP APSGAREALQ
NKAEWKGQLQ AQRAAAVPPE QVAQQAEPAT PPPAPPRPPK AAGGPPAATP AVGPAAPATA
QQTQPGASPN EANLPARSDA VDRAAAPDSN IPAAVPEGSL PGADQPRYPH PITLKMLDEP
PASIPRTLSQ EAEDVAAASR MVELASAQAT DSDALDNYFP RIKQRFGLVS LGYEGDFQRG
FAVVGRINPE FKRRVAEPLS GTGLPGALAS GHVTKIVHEH SQLGGAQVGL TMRARPLGPD
HQQGSGPTGQ DALMAQLPTD PRIYSDTAQR YVRGHLLNDH IGGLGHPMNL FPITASANAQ
HESAVESYAK DWVNNRKLWI DYTVEVKARP ELSRATGGLK KIDAVIDATA AALDTNLDRI
PNLTRHVTIA STYRTASQAE EGELNDFTQA LVDPTAAALQ AERPQDQALT APRSSRETPT
SFPPHIGAAI AAAVVKLGSR SRVAAVLQDH PGFGDVSEEV LFEVYDRIGT TGATVDFLGS
PAERGVLTRI INAWGGKDGI GARLERAIAS AGSGAGP