Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2444 |
Symbol | |
ID | 4897944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2578803 |
End bp | 2579636 |
Gene Length | 834 bp |
Protein Length | 277 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 640113042 |
Product | hypothetical protein |
Protein accession | YP_001044318 |
Protein GI | 126463204 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0102823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0183813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGCTGG CGGCGGGCCT CAAGGCGCCT TCGGAGGCGA TCCGCCGGGT GCGGGTGGCG GGCCATCTGG GCGAGCTCCT GCAGGGCCGC CTCGGGCCCG ACGGGCCGCT GGCGCTGGTG ACCCTGCCCT GCCCCGGCCT CGGCGCCGAG GCGATCCGGG CGCCGGGTCC GCTGGCCCTC GACCAGCCCG GGGCGCAGAT GCTGACGCTG CCCGCGCTCT GCGACCTGCT GGCCTCTGGC GGAGCCGCCC CGGACGGCCG CTTCCGCCTG ACGCTCGACC TGCCGCCGGG CGGCGGGGCG GGCGCCTCGA CCGCGGCGCG GGCGGCGCTC CTCCATGCGG CGGGCGTGGT TGATCCGGCC ACCGTGGCCC GCGCCTGCCT CGCCTCGGAG GGGGCGAGCG ATCCGCTGAT GTTCGACCGG CCCGAGCGGC TGCTCTGGGC CTCGCGCGAG GGGCGGGTGC TGGCAGAGCT GCCGCCCCTG CCGCCGATGG AGCTGGTGGG CGGCTTCTTC GGTCCGGCCC GGCGCACCGA TCCGGCCGAT CTCGCCTTCC CCGACATCTC GGACCTTCTC GAGGGCTGGG GCGCGGCCGA TCTCGCGGGC GTGGCGCGCC GTGCGAGCCT CTCGGCCGCG CGCTGCCTTC AGCTGCGCGG CCCGGCGGAG GATCCGACCG CGGCCCTGGC CCATGGGCTC GGCGCGCTCG GCTGGGCCAT CGGCCACACC GGCCCCGCCC GCGCGCTGAT CTTCCCGCCG GGCGCCGTTC CGCGCGGGGC GGCCGGGGCA TTGCGGGCCG CGGGCTTTTC CCGCATCACG CGCTTCCGCA TCGGCGGAGC CTGA
|
Protein sequence | MTLAAGLKAP SEAIRRVRVA GHLGELLQGR LGPDGPLALV TLPCPGLGAE AIRAPGPLAL DQPGAQMLTL PALCDLLASG GAAPDGRFRL TLDLPPGGGA GASTAARAAL LHAAGVVDPA TVARACLASE GASDPLMFDR PERLLWASRE GRVLAELPPL PPMELVGGFF GPARRTDPAD LAFPDISDLL EGWGAADLAG VARRASLSAA RCLQLRGPAE DPTAALAHGL GALGWAIGHT GPARALIFPP GAVPRGAAGA LRAAGFSRIT RFRIGGA
|
| |