Gene Rsph17029_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3201 
Symbol 
ID4898687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp242521 
End bp244353 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content63% 
IMG OID640113800 
Producthypothetical protein 
Protein accessionYP_001045070 
Protein GI126463957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGTAC CCGTGCTGCC GATGCTCGAG GGCGAGACGC TCATGTCCTA TACCGGCCGC 
GTCGGTCGCT TTCACACGAC GCGCTCACCT CTGCAATTCC TGGAATTGAT CCAGCTTCGT
CGGCGCGCGG TGGTCGAAGG AGCCGAGGAT GCTCTCAGCC GGCTCTCAAT GATGACGGGC
CTGCCGGAGG AGCGGCTTTC CGCAGGGACC ATCGGCAGCG CGGGCGACAG GATATACACG
TTCGGCGCCG AGCAGTTCCA CTCGGTATTC GCCATCCGCG ACACGGCAAC CTTCTGCCCC
GCCTGCCTGC TTGAGGATCG TGTCGACGCA GCAACGAAGG GGCAGCGCGT CGGTCGGCTG
ATTTGGCTTT TCCGGACTAC CCGCACCTGC CCGCGGCATG GCGTAGCGAT GATCCGGCGT
CCGTCGGCCG CAATGGCTTT GGGCGGTTTT CTCGACATGG AAGCGGTCGC CGGATCCGAC
GAAGAGCTTG AGCGGATCGT CCTGCAATTG CCGCGCCGCG ACGTGTCTCC TCTCCAGCGG
TACGTGACCG AGCGCCTGCT CGGCGCGGCG GGTCCAGCCT GGCTCGATGG ACAGCAGATG
GACCATGGCG CCCGTGCCAG CGAAATGCTC GGCGCCTGCC TGGAGTACGG TACCGACTTC
GCGGCGAACG GCCTGAGCGA GGACGACTGG GACAGGGTTG GCCGTACCGG GTACGCGTAT
ACAGCCCGCG GCACGGAAGG CGTCGCCGAG GCACTGCAGC TTCTGCACAC GCAGTTCCTG
CAATCCGGGA AGGACGGGGG GCCCCAGCAG GTCTTCGGCT CGTTGTACAA ATGGCTGCAG
TTCCGCCCCT ACGCGAAGCC AGCCGGGTTG ATCGAGGATC TGGTTCGCGA CTACATCCTT
GATCATTTTT CGGTGGAGCC AGGAAAGAAA CTTCTCGGAG TTCCCGTGGT AAAGCGTCAA
CGGCACAGCA TCGGAAGCCT TGCGGCGGCA ACCGGTCTTC ATCCCCGGAC CTTGAACCGC
GCGCTCATCA TTACCGGAGT GCTTTCGGGT GACCCGGATG TGGTCGACGG ACGCTCGTCA
TTCGATGCAA AGACGGGGGA AGATCTTGCC GGACGCATCC GGAACTCGAT CTCGACCACC
CAACTGCCGA AGTATCTTGG GTGCAATAGA ACTCAGGCGC AGGAACTGGT GCGCAGTGGG
GTGCTGCCCC GGATCGTGAA CCAGGATGGC AAGCAGACGG GGATGCTGTC GAACGTGCCG
CTAGCCGAGG TCGACGACTT CCTACACCGG TTGCGCGCCG CAGGCGTTCT GGTCGATGCG
CCCGGAGCCG GAATGATGGA CATCGTGGCG GCCTCGAGTG CCGCTCGCTG GCCTGCGCTT
GACATCGTGA AACTCGTGCT CGCCGGCGCC CTCGCGCGCG TCGAGGTGCT CGGAACCGAT
CTGAAGTTTC TCTCAGTGCT GGTCGATCCG ATGGAGGTGC GCGCAAAGAC CCATCTCGAG
GAGACTGCCG ACGGCCTCAG TCAGGCGGCC GCCGCTCGCC TCCTCGGCGT GATGACCAGC
GGGCTGACGT ACCTCGTGCA GAACAAAGAC CATGATGGCA AGCCGTTCAT CCCTTACATC
CCGGTGCGCA ACTCGGCAGG CAGGGAACAA CGTTATTTTG ACGCGCGCGA GCTAGCGCGG
TTTTCCGACC GGTACATTCA TCTCAAGGAT GCGGCGCGCC AGGCAGGAAT CTCGTCAAAA
CTGATGCGGC AGCATCTCGC GAGCCGAGGC ATCGAGCCGA TTGCACCGAG AAATGTGCTG
AACGCTCAGA TGTACCGACG ATCCGAGATC TGA
 
Protein sequence
MLVPVLPMLE GETLMSYTGR VGRFHTTRSP LQFLELIQLR RRAVVEGAED ALSRLSMMTG 
LPEERLSAGT IGSAGDRIYT FGAEQFHSVF AIRDTATFCP ACLLEDRVDA ATKGQRVGRL
IWLFRTTRTC PRHGVAMIRR PSAAMALGGF LDMEAVAGSD EELERIVLQL PRRDVSPLQR
YVTERLLGAA GPAWLDGQQM DHGARASEML GACLEYGTDF AANGLSEDDW DRVGRTGYAY
TARGTEGVAE ALQLLHTQFL QSGKDGGPQQ VFGSLYKWLQ FRPYAKPAGL IEDLVRDYIL
DHFSVEPGKK LLGVPVVKRQ RHSIGSLAAA TGLHPRTLNR ALIITGVLSG DPDVVDGRSS
FDAKTGEDLA GRIRNSISTT QLPKYLGCNR TQAQELVRSG VLPRIVNQDG KQTGMLSNVP
LAEVDDFLHR LRAAGVLVDA PGAGMMDIVA ASSAARWPAL DIVKLVLAGA LARVEVLGTD
LKFLSVLVDP MEVRAKTHLE ETADGLSQAA AARLLGVMTS GLTYLVQNKD HDGKPFIPYI
PVRNSAGREQ RYFDARELAR FSDRYIHLKD AARQAGISSK LMRQHLASRG IEPIAPRNVL
NAQMYRRSEI