Gene Rsph17029_4069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4069 
Symbol 
ID4895024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp5009 
End bp6781 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content76% 
IMG OID640110471 
ProductTPR repeat-containing protein 
Protein accessionYP_001041783 
Protein GI126464807 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.00017793 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value0.44147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGC AGCCCGCCCC GATCCTGCCC ATCGGCTCCG TCTCTCCCCC GCTCACGGCC 
GAGCAGCTGG TGGGTCTGGC CGAGGCCGCC CCCGCCGCGG CGATCGAGAT CTACCGCCGC
TGGCTCGCGC TCCATCCCGA GCGGCCCGAC GCCTGGATCG CCTGGTTCAA TCTCGCGGTG
CTGCTCGAGG CGGCGGGCGA GCCGCAGGGG GCGCTCGGCG CGGCCGCCAC CGCGCTCCGC
CAGAAGCCGG ACCTGTGGCA GGCGGCCCTC GCGGCGGGTC AGGCGGCCGA GGCGCAGGGC
GACCGGACGC AGGCGCTGGC CTTCCTGCGC CAGGTGCTGC CCCCGGCCGA GGGGCGGCGC
CAGCTCCACC GCCAGCTCGG CCGGATGCTC GAGGCCGAGG GCCGGCTCGC GGAGGCCGCC
GAGGAGCTGC GGGCCTCGCT TCTCCTCGAT CCCCGCCAGC CCGAGGTGGT CCAGCATCTC
GTCCATGCCA GCCAGAAGAT GGCGGCCTGG CCCCCGGCCC GGCTCGCCGT CCCCGGCCTG
ACCGAGGCCG AGGCCGAGCT GCGGTGCGGC CCGCTCGCCA CCCTCGCGCT GCATGACGAT
CCCGTGCGGC AGGGCGAGGT GGCCGCGGCC TGGATCGCCC GGCATGTGCC CGATCCGGGC
ATCCGGCTCG CCCCGGCCGG GGGCTACCGC CACGACCGGC TGCGGCTCGG CTATCTCTCG
TCGGACTTCT GCCGCCACGC CATGAGCTTC CTCATCGCCG AACTGCTCGA GCGCCACGAC
CGCAGCCGGT TCGAGGTGGT GGGCTACTGC GCCTCGCCCG AGGACGGCAG CCCCGAGCGC
GCGCGGGTGC TCGCCGCCCT CGACCGGCAT GTGCCGATCG GCCCCCTCTC CGACGAGGCC
GCGGCCCGGC GCATCCGCGC CGACGAGATC GACCTGCTGA TCGATCTCAA CGGGCTGACC
CGCGGCGCGC GGCCGGGCAT CCTGCGCTGG AAGCCCGCCC CGGTGCAGGC GACCTATCTG
GGCTATATCG GGCCGGTCCC GCTGCCCGAG CTCGACTGGC TGATCTGCGA CCGAGTGACC
GTGCCCGAGG CCGAGGCCGC CCATTACCGC CCGGCCCCGC TCCGGCTCGA GGGCTGCTAT
CAGGCCAACG ACGGGCAACG GCCCCTGCTG CCCGCCGTCG ACCGCCCGGG CGAGGGCTTG
CCCGAGGCCG CCTTCGTCTT CGCCTGCGCC TCGCATTTCT ACAAAATCAC CGAGCCCCTC
TTCGCCGCCT GGTGCCGGAT CGTCGCGGCC GTGCCGGGGT CGGTCCTGTG GCTCGTCGCG
GATACGCCCG AGGGGCAGGC GGCGCTGGCC GGCCGCTGGC AGGCGGCGGG CCTCGACCCC
CACCGGCTGA TCTTCGCCCC CCGCGTCGAT CCCGCCCGCT ACCGGGCGCG GCTGGCACTG
GCCGACCTCT TTCTCGACAC GATGCCCTAC AATGCCGGGA CCATCGCCTC GGACGCGCTC
CGGATGGGGC TGCCCGTGCT CACGCTCGCG GGGCGGACCT TCTCGGGCCG GATGGCGGCG
AGCCTCCTCA CGGCGGTGGG GCTGGAAGAT TGCATCGCCC CCGACCTCGA GGCCTATGTC
GCCCGCGCCG TGGCGATCGC CACCGACCCT GCGGCGGCCC CCGCCCTGAC GGGGCCCGCG
CTCGCCGAGC GCTGGAGCCT CACCTTGGGC GACTGCCGCG ATTTCACCCG CCGCTTCGAG
GCGGCCCTGC TCTCGGTCGC CCGCCGCGCC TGA
 
Protein sequence
MTVQPAPILP IGSVSPPLTA EQLVGLAEAA PAAAIEIYRR WLALHPERPD AWIAWFNLAV 
LLEAAGEPQG ALGAAATALR QKPDLWQAAL AAGQAAEAQG DRTQALAFLR QVLPPAEGRR
QLHRQLGRML EAEGRLAEAA EELRASLLLD PRQPEVVQHL VHASQKMAAW PPARLAVPGL
TEAEAELRCG PLATLALHDD PVRQGEVAAA WIARHVPDPG IRLAPAGGYR HDRLRLGYLS
SDFCRHAMSF LIAELLERHD RSRFEVVGYC ASPEDGSPER ARVLAALDRH VPIGPLSDEA
AARRIRADEI DLLIDLNGLT RGARPGILRW KPAPVQATYL GYIGPVPLPE LDWLICDRVT
VPEAEAAHYR PAPLRLEGCY QANDGQRPLL PAVDRPGEGL PEAAFVFACA SHFYKITEPL
FAAWCRIVAA VPGSVLWLVA DTPEGQAALA GRWQAAGLDP HRLIFAPRVD PARYRARLAL
ADLFLDTMPY NAGTIASDAL RMGLPVLTLA GRTFSGRMAA SLLTAVGLED CIAPDLEAYV
ARAVAIATDP AAAPALTGPA LAERWSLTLG DCRDFTRRFE AALLSVARRA