Gene Rsph17025_3694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3694 
Symbol 
ID5085560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp596174 
End bp598507 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content55% 
IMG OID640485257 
Producthypothetical protein 
Protein accessionYP_001169866 
Protein GI146279708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.560249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.910445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCAG TGCAGGCTGC CGATGAAATG TACGACTCCA ACCCGCATCC GGACCGGCGC 
CAGCTAGTCT CGAATGGCTT CGAGGTTAAT CTGCCTGACC AGGTCGAAGT GATTGTTCGC
GATCTGCCGG ATCCATCGAA AGTGAAGGAA GAGCGCACCC GGCTGATGGG GTATTGGTTT
GTTCATTGGT TTGACGGGAA GCTTTTTCAC CTCCGCATCA AGGCCGGCGG CCCGAATGTT
GATGGCGAAC ACCGAGCCAT CCGAACGGCT GAACATCCCT GGCTTCTACG AGCCCGACTG
GATGATGCAC TGGAGGAAGC CCTACCGAAA TATGCAGCTG TAAAGAAAAG GCCCTTCACT
TTCCTCGCAC AGAAAGACGA ATTGATCGAT GCTGCGGCTA CGGCGGCCGG GTTGTCCCAC
CGGCTGTTGA ACAGCTTCAA AGTCATCCCC CGCTTCGCGT TGAGCCCGAA AATCTACGAA
CCAGTAGACG GCACAACCCG TGTCGGCGTC TTTGTCACAA TCGGCATGCG CTACGACATA
GAGGCTAGTC TGCGAGACCT TCTTGAAGCC GGTATAGATC TTCGTGGGAT GTATGTCGTC
AGGCGGAAGA GACAGCCTGG TGAGCGAGGA TTGCTGGGCC GCGTTCGAGC AATCAGTGAC
GATATGGTTC AGCTTTTCGA GGAAACGGAT CTGGCCAGTG TCAACGTGAA CGACGCAAAG
CTGGAAGGGT CAAAGGAGAA TTTCACTAGA TGCCTGTCTG CATTGTTGGG CCACAACTAC
AAGAAGCTTT TGAACGCCCT TGATGATCAG GAGGCAGGCT ACCGCACGGG CCCAAGGTTC
GATGATGCTG TCAGGAGAAT GGGTGAGTTT CTAGCGAAAA AACCCATCAG GCTTGCCGAT
AACATAAACG CTCAGGTCGG GGACCGGATT GTGTTCTCCA ATGAGGGACA GGCACGCAAC
GTCCGGTTGG CGCCCAAGGT AGAATATGTG TTCGACCGCA CGGGAGCAAA ATCGGCGGAG
TATGCTTGGA GAGGGCTATC CCAATTTGGT CCTTTTGATC GACCAAGCTT TGCCAATCGT
TCCCCGCGAA TTCTTGTGGT CTATCCCTCC TCCACTCAAG GAAAAGTGGA GAATTTCTTG
TCGGCGTTCC GTGACGGGAT GGGTTCGAAC TACTCGGGTT TCTCGAAGGG TTTTGTCGAC
CTGATGGGTT TGACCAAGGT TGAGTTTGTG ATGTGCCCGG TGGAGGTGAG CAGCGCAGAT
CGAAATGGCG CTCACACCAA GTACAATTCG GCGATTGAAG ACAAGCTCGC CGGGGCGGGA
GAGGTCCATG CCGGTATTGT GGTTCTGTTC GAAGACCACG CACGGCTCCC GGACGACCGA
AACCCATATA TCCATACCAA GTCCCTGCTA CTGACGCTCG GCGTTCCAAC GCAGCAAGTC
AGAATGCCAA CTGTTCTTCT AGAGCCTAAG AGTCTGCAGT ACACACTGCA GAATTTCTCC
ATCGCGACCT ATGCAAAGTT GAATGGCACG CCATGGACCG TGAACCACGA CAAAGCAATC
AACGACGAAC TAGTGGTGGG GATGGGGCTG GCGGAGCTTT CGGGCAGCCG GACAGAAAAG
CGCCAGCGGT TTGTCGGCAT CACCACAGTA TTTGCGGGAG ACGGCTCCTA TCTCCTCGGC
AACGTGTCCA AGGAATGCGA GTACGAAGGC TACTCAGACG CAATTCGTGA GTCCATGACT
GGCATTTTGC GCGAACTTAA GAAGCGCAAT AATTGGCGCC CGGGGGATAC GGTTCGCGTT
GTATTCCACG CCCACCGTCC ACTGAAACGG GTGGACGTAG CCAGCATCGT TTTCGAATGC
ACGCGTGAGA TCGGCAGTGA TCAGAACATC CAGATGGCGT TTGTCACCGT CTCGCATGAC
CACCCATTCG TGCTCATCGA CAGGTCTGAG CGTGGCTTGG AGGCATACAA GGGCAGCACC
GCGCGGAAGG GGGTCTTTGC GCCCCCACGC GGCGCAATAT CGCGTGTGGG TCGGCTGACA
CGCCTTTTGG CGGTTAACTC CCCGCAGTTG ATCAAGAGGG CAAACACACC TTTGCCGACA
CCTCTTCTTG TCTCGCTCCA TCCGGATTCG ACGTTCAAAG ATGTGGACTA TCTCGCTGAG
CAGGCATTGA AATTCACTAG CCTTTCATGG CGCTCCACGC TTCCCGCTGC CACCCCCGTA
ACGATTTTCT ATTCGGAACG AATTGCAGAG CTTTTGGGTC GTCTCAAAAG CATTCCCAAC
TGGTCTTCTG CCAACCTGAA CATCAAGCTC AAGTGGAGCC GCTGGTTCCT ATGA
 
Protein sequence
MAPVQAADEM YDSNPHPDRR QLVSNGFEVN LPDQVEVIVR DLPDPSKVKE ERTRLMGYWF 
VHWFDGKLFH LRIKAGGPNV DGEHRAIRTA EHPWLLRARL DDALEEALPK YAAVKKRPFT
FLAQKDELID AAATAAGLSH RLLNSFKVIP RFALSPKIYE PVDGTTRVGV FVTIGMRYDI
EASLRDLLEA GIDLRGMYVV RRKRQPGERG LLGRVRAISD DMVQLFEETD LASVNVNDAK
LEGSKENFTR CLSALLGHNY KKLLNALDDQ EAGYRTGPRF DDAVRRMGEF LAKKPIRLAD
NINAQVGDRI VFSNEGQARN VRLAPKVEYV FDRTGAKSAE YAWRGLSQFG PFDRPSFANR
SPRILVVYPS STQGKVENFL SAFRDGMGSN YSGFSKGFVD LMGLTKVEFV MCPVEVSSAD
RNGAHTKYNS AIEDKLAGAG EVHAGIVVLF EDHARLPDDR NPYIHTKSLL LTLGVPTQQV
RMPTVLLEPK SLQYTLQNFS IATYAKLNGT PWTVNHDKAI NDELVVGMGL AELSGSRTEK
RQRFVGITTV FAGDGSYLLG NVSKECEYEG YSDAIRESMT GILRELKKRN NWRPGDTVRV
VFHAHRPLKR VDVASIVFEC TREIGSDQNI QMAFVTVSHD HPFVLIDRSE RGLEAYKGST
ARKGVFAPPR GAISRVGRLT RLLAVNSPQL IKRANTPLPT PLLVSLHPDS TFKDVDYLAE
QALKFTSLSW RSTLPAATPV TIFYSERIAE LLGRLKSIPN WSSANLNIKL KWSRWFL