Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_4069 |
Symbol | |
ID | 4895024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009040 |
Strand | + |
Start bp | 5009 |
End bp | 6781 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640110471 |
Product | TPR repeat-containing protein |
Protein accession | YP_001041783 |
Protein GI | 126464807 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 0.00017793 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 89 |
Fosmid unclonability p-value | 0.44147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTGC AGCCCGCCCC GATCCTGCCC ATCGGCTCCG TCTCTCCCCC GCTCACGGCC GAGCAGCTGG TGGGTCTGGC CGAGGCCGCC CCCGCCGCGG CGATCGAGAT CTACCGCCGC TGGCTCGCGC TCCATCCCGA GCGGCCCGAC GCCTGGATCG CCTGGTTCAA TCTCGCGGTG CTGCTCGAGG CGGCGGGCGA GCCGCAGGGG GCGCTCGGCG CGGCCGCCAC CGCGCTCCGC CAGAAGCCGG ACCTGTGGCA GGCGGCCCTC GCGGCGGGTC AGGCGGCCGA GGCGCAGGGC GACCGGACGC AGGCGCTGGC CTTCCTGCGC CAGGTGCTGC CCCCGGCCGA GGGGCGGCGC CAGCTCCACC GCCAGCTCGG CCGGATGCTC GAGGCCGAGG GCCGGCTCGC GGAGGCCGCC GAGGAGCTGC GGGCCTCGCT TCTCCTCGAT CCCCGCCAGC CCGAGGTGGT CCAGCATCTC GTCCATGCCA GCCAGAAGAT GGCGGCCTGG CCCCCGGCCC GGCTCGCCGT CCCCGGCCTG ACCGAGGCCG AGGCCGAGCT GCGGTGCGGC CCGCTCGCCA CCCTCGCGCT GCATGACGAT CCCGTGCGGC AGGGCGAGGT GGCCGCGGCC TGGATCGCCC GGCATGTGCC CGATCCGGGC ATCCGGCTCG CCCCGGCCGG GGGCTACCGC CACGACCGGC TGCGGCTCGG CTATCTCTCG TCGGACTTCT GCCGCCACGC CATGAGCTTC CTCATCGCCG AACTGCTCGA GCGCCACGAC CGCAGCCGGT TCGAGGTGGT GGGCTACTGC GCCTCGCCCG AGGACGGCAG CCCCGAGCGC GCGCGGGTGC TCGCCGCCCT CGACCGGCAT GTGCCGATCG GCCCCCTCTC CGACGAGGCC GCGGCCCGGC GCATCCGCGC CGACGAGATC GACCTGCTGA TCGATCTCAA CGGGCTGACC CGCGGCGCGC GGCCGGGCAT CCTGCGCTGG AAGCCCGCCC CGGTGCAGGC GACCTATCTG GGCTATATCG GGCCGGTCCC GCTGCCCGAG CTCGACTGGC TGATCTGCGA CCGAGTGACC GTGCCCGAGG CCGAGGCCGC CCATTACCGC CCGGCCCCGC TCCGGCTCGA GGGCTGCTAT CAGGCCAACG ACGGGCAACG GCCCCTGCTG CCCGCCGTCG ACCGCCCGGG CGAGGGCTTG CCCGAGGCCG CCTTCGTCTT CGCCTGCGCC TCGCATTTCT ACAAAATCAC CGAGCCCCTC TTCGCCGCCT GGTGCCGGAT CGTCGCGGCC GTGCCGGGGT CGGTCCTGTG GCTCGTCGCG GATACGCCCG AGGGGCAGGC GGCGCTGGCC GGCCGCTGGC AGGCGGCGGG CCTCGACCCC CACCGGCTGA TCTTCGCCCC CCGCGTCGAT CCCGCCCGCT ACCGGGCGCG GCTGGCACTG GCCGACCTCT TTCTCGACAC GATGCCCTAC AATGCCGGGA CCATCGCCTC GGACGCGCTC CGGATGGGGC TGCCCGTGCT CACGCTCGCG GGGCGGACCT TCTCGGGCCG GATGGCGGCG AGCCTCCTCA CGGCGGTGGG GCTGGAAGAT TGCATCGCCC CCGACCTCGA GGCCTATGTC GCCCGCGCCG TGGCGATCGC CACCGACCCT GCGGCGGCCC CCGCCCTGAC GGGGCCCGCG CTCGCCGAGC GCTGGAGCCT CACCTTGGGC GACTGCCGCG ATTTCACCCG CCGCTTCGAG GCGGCCCTGC TCTCGGTCGC CCGCCGCGCC TGA
|
Protein sequence | MTVQPAPILP IGSVSPPLTA EQLVGLAEAA PAAAIEIYRR WLALHPERPD AWIAWFNLAV LLEAAGEPQG ALGAAATALR QKPDLWQAAL AAGQAAEAQG DRTQALAFLR QVLPPAEGRR QLHRQLGRML EAEGRLAEAA EELRASLLLD PRQPEVVQHL VHASQKMAAW PPARLAVPGL TEAEAELRCG PLATLALHDD PVRQGEVAAA WIARHVPDPG IRLAPAGGYR HDRLRLGYLS SDFCRHAMSF LIAELLERHD RSRFEVVGYC ASPEDGSPER ARVLAALDRH VPIGPLSDEA AARRIRADEI DLLIDLNGLT RGARPGILRW KPAPVQATYL GYIGPVPLPE LDWLICDRVT VPEAEAAHYR PAPLRLEGCY QANDGQRPLL PAVDRPGEGL PEAAFVFACA SHFYKITEPL FAAWCRIVAA VPGSVLWLVA DTPEGQAALA GRWQAAGLDP HRLIFAPRVD PARYRARLAL ADLFLDTMPY NAGTIASDAL RMGLPVLTLA GRTFSGRMAA SLLTAVGLED CIAPDLEAYV ARAVAIATDP AAAPALTGPA LAERWSLTLG DCRDFTRRFE AALLSVARRA
|
| |