Gene Rsph17025_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4072 
Symbol 
ID5086245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp122326 
End bp124620 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content75% 
IMG OID640485635 
Producthypothetical protein 
Protein accessionYP_001170229 
Protein GI146280072 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0374389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.138857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCAG CCCTCCTCGA TCCGACAAGG CCGGTCCGGC GCGCCGCCGC GGCCCTTGTT 
GTTGCCCTTG CTCTTCTGAC GGGCGGATTT TCCGCAAGTC CGGCCCTGGC GTGCGATGAG
GACACCGCGC GGCTGGTGCC GCGCACCATC CTCGCGCTGC ATGACGGTGG CCGGGAAGGC
GCGGTGCGCC AGACCCGCCT TCATCGGGCG GCGGAAATGA TCCTGAACCA TATGGGGTAT
GTCCTCGATT ATCACGACGT GGCGGCCGGA CCGCCACCGC CCCTCCTGCC GGCGGAGGTG
GCGGCGACTC TCTCGTGGTT CGACGCCCCC CTGGCGCGGG ATGCGGACTT CGCCCGCTGG
GCGGCCTCGG TCGAGCGCGA CTGCGGCGGC CTGCGGATGA TCGCGTTCGA CCATGTCGGG
ATCCGCCCCG AACTGGCCTA TGACGGGCAG CGCGACGCCT ATCTGTCGCG GCTGGGGCTC
AAGGTGTCGG GGGGCGAGAC CGCGCTTGGC GTGCTGTCGC GCACGGTGGC GATGGATGCG
GCGATGATCG GCCACGAGAC GCCCTTCGAC ATCGAGCCGG GCCGCCATGC CGCGCTCGAG
GCGGTCCCGC CCGCGCGCAG CCTGCTCCGG GTCGAGGCCG CGCCCGGCGC GCCCGCCTTC
GATCTCGTGG TGCTGGGGCC GCGCGGCGCC TTTGCCGACA GTTCGGCCAC GCTGCGCCAC
GATGCGCGGG GCGGCCTGTC CTTCTGGGTG CTCGATCCCT TCGCCTTCTT CGAGGCCGCG
CTCGGCCAGC CGCCGGCCCC GGTCCCGGAC GTGACGACCG AGGGCGGGCG GCGGCTCTTC
TTCTCGGTCG TGCGCCCCGA GGGCTGGCTG GTGGCCGAGC CCGCGCTCCG CTTCGGGCAG
AAGACGCGCA TCGGCTCGGA ACTGCTGCTC GACCGGCTGG TGGCGCCCTA TCCCGACCTT
CCGGTCAGCA TCGGCGTGCT GACCGGGGAT CTCGATGCCA CCCTCGCCGG GCCCGAGGCG
CCGCGCGGGC GCGTCGCGGC GCGCGCGCTC TTTGCCCTGC CGCAGGTCCG GCCCGGCACC
GCGGGCCACA GCCTGGTGCG GACCTGGGCC TTCTTCGACG GGTATGACCC CGCGCGCGAG
GCGCAGCTGC TGCGCGAGCT GCGCGGGGCG GCCGATCCCG GGGCGGCGGG GCTGATCGGC
GCGGCGGTCC GCACACTGGA GCGGGCGCTG CCCCTGCCTG CGCCGGTGGC CGCCGGCCGG
ATGGCCGAGG CGCCGCGGCG CTACATGCGC GATCCGTTCG ACCTCTGGCG CGAGACGACC
GGCGCCCTGG CCGAGGTGTC CGAGCTGGCC GGGGGCCGCC CGGCGGGCTT CCATCTCTGG
ACGGGGGATG CCAGCCCCCA CGCCGACGCG ATGGCGGCCG TGGCCGGGGC CGGGGCGGCG
GGGCTGGGGG GCGGAGGCGG GATCTACAAC CGCTTCGCCC CGGCGCTGTC GAACCTCTCG
GCCTATGCCG TCCGCGAGGG CGGACAGCTG CAGGTCTACA ACGCCCTGTC GGGCGACGAG
GCCTACACCG GCTTCTGGAC CACGCCCGAG CACGGGTTCC ACGCCTTTGC CGACACGGTC
CGGGCGACCG GAAGCCCCCG GCGCCTGCGC CCGTTCCAGC TGGGCTTTGC CGCCAGTTCG
GCCCTGTCCT TCGGCACGCG GGCGGCGGTC GAGGCGGCGC TCGGGCTGGC ACGCGCCTCG
CCGGTCCTGC CTGTGGCCGC CGCCGATTAC GTCCGCATCG TCGAGGGCTT TCACTCGGCC
CGCCTGCGGC GCGAGGGCGA GCGGGCCTGG CGCATCACGA ACCGGGGCGC GTTGCAGACC
CTGCGGGTCG AGCGGGCGGC GGGGCTTGCG CTCGATCTGG CGGCCAGCCG CGGCGTGCTG
GGGGCGGTGC GCGAAGAGGA CAGGCTCTAC ATCGCGCTGA ACCCGTCGGA CCCCGAGCCG
CTGGTGGCGC TGGCCGAGGA TCCCGCTCCG CTGGGAATGC GCGGCCTGGA CCGTCCGGCC
CTCGTCTCCA GCCGCTTCCG GGTCCTGGAC TTCGAGGCCG GACGCTGCCG GATCGCCCTG
CGGCTGCAGG GATGGGCGCC GGGCGTGATG GACTGGCAGG CCCGGCCCGG AACGCGCTTC
GCGATCGATG TCCGGCGGCT CGGCACGTCC TACGCCGTGG GCAGCGCCAT CGCCGATCCC
GAGGGGCGGC TGTCGATCCA GATCCCCGAT CCGCAGGGTG CGGATCTCGA CGTGACGCTT
GACGGCGACT GCTGA
 
Protein sequence
MRSALLDPTR PVRRAAAALV VALALLTGGF SASPALACDE DTARLVPRTI LALHDGGREG 
AVRQTRLHRA AEMILNHMGY VLDYHDVAAG PPPPLLPAEV AATLSWFDAP LARDADFARW
AASVERDCGG LRMIAFDHVG IRPELAYDGQ RDAYLSRLGL KVSGGETALG VLSRTVAMDA
AMIGHETPFD IEPGRHAALE AVPPARSLLR VEAAPGAPAF DLVVLGPRGA FADSSATLRH
DARGGLSFWV LDPFAFFEAA LGQPPAPVPD VTTEGGRRLF FSVVRPEGWL VAEPALRFGQ
KTRIGSELLL DRLVAPYPDL PVSIGVLTGD LDATLAGPEA PRGRVAARAL FALPQVRPGT
AGHSLVRTWA FFDGYDPARE AQLLRELRGA ADPGAAGLIG AAVRTLERAL PLPAPVAAGR
MAEAPRRYMR DPFDLWRETT GALAEVSELA GGRPAGFHLW TGDASPHADA MAAVAGAGAA
GLGGGGGIYN RFAPALSNLS AYAVREGGQL QVYNALSGDE AYTGFWTTPE HGFHAFADTV
RATGSPRRLR PFQLGFAASS ALSFGTRAAV EAALGLARAS PVLPVAAADY VRIVEGFHSA
RLRREGERAW RITNRGALQT LRVERAAGLA LDLAASRGVL GAVREEDRLY IALNPSDPEP
LVALAEDPAP LGMRGLDRPA LVSSRFRVLD FEAGRCRIAL RLQGWAPGVM DWQARPGTRF
AIDVRRLGTS YAVGSAIADP EGRLSIQIPD PQGADLDVTL DGDC