Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3566 |
Symbol | |
ID | 4898121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 657090 |
End bp | 658289 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640114175 |
Product | cobalamin synthesis protein, P47K |
Protein accession | YP_001045429 |
Protein GI | 126464316 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.107446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0714729 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGATA CCCGCCTTCC CGTGACCGTG CTCTCCGGCT TCCTCGGAGC CGGCAAGACC ACGCTTCTGA ACCATATTCT GGCCAACCGC GAGGGGCTGC GCGTGGCCGT CATCGTCAAT GACATGTCGG AGGTGAACAT CGACGCCGAT CTGGTCCGCG CCGGCGGCAG CCTCTCGCGT GGCGAGGAAC GGCTGGTCGA ACTGACCAAC GGCTGCATCT GCTGCACGCT GCGCGACGAC CTGCTGACCG AGGTGCGCCG GCTGGCCGAG GAGGGGCGCT TCGACTATCT GCTGATCGAA TCCACCGGCG TGTCCGAGCC GCTGCCCGTC GCCGCCACCT TCGAGTTCCG CGACGAGGAG GGCCGGTCGC TCTCGGACAT CGCGCGGCTC GACACGATGG TGACGGTGGT CGATGCGGCC AACCTCACCC GCGACTTTTC GGCGCAGGAT TTCCTGAAGG ACCGCGGTGC GGCGATGGCG GAGGAGGACG AGCGCAGCCT TGTCCAGCTT CTGACCGAGC AGATCGAGTT CGCCGACGTG ATCGTGCTGA ACAAGGTCTC GAGCGCGACG CCCGAACAGC TCGCCAGCGC ACGCGCCATC CTGCGGGCGC TGAATGCCGA TGCCGCGATC CTCGAGACCG ATCACGGCCG CGCCCCGCCG CGCGCCATCG TCGGCACCGG GCGGTTCAGC TTCGAGGCCG CGCACCGCCA TCCGACCTGG GTCAAGGAGC TCTACGGCCA TGCCGACCAT GTGCCCGAGG ATGCCGAATA CGGCATCACC AGCTTCGTCT GGCGGGCCGA ACGGCCGTTC GATCCGACGC GGATCGCCGA CTTCTTCGAC ACGCCGCTGC CCGGCGTGAT CCGCGCCAAG GGCCATTTCT GGATCGCCAC GCGCCCCGAC TGGGCGGGCG AGTTCTCGGT GGCGGGGCCG ATGGTCGAGG TGAAGGGGCT GGGCCTCTGG TGGGCCGCCG TGCCCGAGGC GCACTGGCCG CCCGAGGCGC AGGAGCGGAT GGCTGCGCGC GTGGCCGGAG AGTTCGGCGA CCGGCGGCAG GAGATGGTCT TCATCGGCAT GCTCGGGCAG ATGAACCGGG CGCGGATCTC GGCCATGCTC GAGCGCTGCC TTGTCCCCGA AACCCGGTTC GCGCCCGAGG CATGGACGGC GCTGGAAGAT CCGTTCCCGC GCTGGGGCCG CGCGGCATGA
|
Protein sequence | MTDTRLPVTV LSGFLGAGKT TLLNHILANR EGLRVAVIVN DMSEVNIDAD LVRAGGSLSR GEERLVELTN GCICCTLRDD LLTEVRRLAE EGRFDYLLIE STGVSEPLPV AATFEFRDEE GRSLSDIARL DTMVTVVDAA NLTRDFSAQD FLKDRGAAMA EEDERSLVQL LTEQIEFADV IVLNKVSSAT PEQLASARAI LRALNADAAI LETDHGRAPP RAIVGTGRFS FEAAHRHPTW VKELYGHADH VPEDAEYGIT SFVWRAERPF DPTRIADFFD TPLPGVIRAK GHFWIATRPD WAGEFSVAGP MVEVKGLGLW WAAVPEAHWP PEAQERMAAR VAGEFGDRRQ EMVFIGMLGQ MNRARISAML ERCLVPETRF APEAWTALED PFPRWGRAA
|
| |