Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3679 |
Symbol | |
ID | 4898485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 782257 |
End bp | 783474 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640114287 |
Product | cobalamin synthesis protein, P47K |
Protein accession | YP_001045541 |
Protein GI | 126464428 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.191921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.393021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCCG ATCCCGCCCG TTCCGGCAAA CTGCCCACCG CGATCCTTTC GGGCTTTCTG GGTGCGGGAA AGACGACCCT GCTCAACCAC CTGCTGCGCA ACCGCAGCGG CCTGCGCGTG GCCGTGATCG TGAACGACAT GTCCGAGATC TGCATCGACG CCGAGCTCGT GGCCTCGGGC GGCGCCTCGC TCAGCCACAC CGAGGAGGAG CTGGTCGAGA TGTCGAACGG CTGCATCTGC TGCACGCTGC GCGACGATCT GCTGCGCGAG GTGCGCCGTC TGGCGCTCGA GGGGCGGTTC GATTACCTGC TGATCGAGGC GACCGGCATC TCCGAGCCGC TGCCCATCGC CGCCACCTTC GAATTCTGCG ACGGCGCCGA ATCGAGCCTG AGCGACGTGG CGCGGCTCGA TGCGATGGTC ACGGTGGTGG ATGCGGTCAA TCTCACGCAG GATTACCTGA GCCGCGACCT TCTGCGGGAC CGCGGCGAGG TCCGCGGCAC GGACGACCAG CGCACGCTGG TCGAGCTGCT GGTGGACCAG ATCGAATTCG CCGACATCGT GATCCTCAAC AAGACCCGCA CGGCGGGTCC CGAGCGCACC GCGGCGGCGC GGCGCATCGT GCGGGCGCTC AATCCCGATG CGAAGCTGAT CGAGACCGAA CAGAGCGAGG TCGATCCGCG CGAGATCCTC GACACCGGGC TCTTCGATCC GGCCCGTGCC CGCAGCCATC CGCGCTGGCT GCAGGAGCTC TACGGTTTCG CGGCGCACAA GCCCGAGGAT CAGGAATACG GCATCACCTC CTTCTGCTTC CGCGCGCGGG CGCCCTTCGA CGGCAAGCGC ATCCGCGACG TGCTGACGGG CGAGCTGCCG GGGGTGATCC GGGCCAAGGG GCATTTCTGG ACCGCCGACC ACCCCGACCG CGTGCTCCAG TTCAGCCAGG CGGGCAGCCT GCGCACCATC GCGCAGAGCG GGCGCTGGTG GGCGGCCACG CCGCGCAGCG ACTGGCCCGC GGATCGGCGC GTGATCGAGC GGATCGCGCG GCACTGGCGC CCGCCCTACG GCGACCGGCG TCAGGAGCTC GTGTTCATCG GCACGCGCGA GATGGACTAC GGCAGGATCC ATCCGCTGAT CGACGCCTGC CTGCTGCGGG GCACCCCGGT GCGCCGGCCG GCCATGACGG CCGCGGGCCT CGCGTCTCAG GGCGCCCGAC GGGGCTAG
|
Protein sequence | MSSDPARSGK LPTAILSGFL GAGKTTLLNH LLRNRSGLRV AVIVNDMSEI CIDAELVASG GASLSHTEEE LVEMSNGCIC CTLRDDLLRE VRRLALEGRF DYLLIEATGI SEPLPIAATF EFCDGAESSL SDVARLDAMV TVVDAVNLTQ DYLSRDLLRD RGEVRGTDDQ RTLVELLVDQ IEFADIVILN KTRTAGPERT AAARRIVRAL NPDAKLIETE QSEVDPREIL DTGLFDPARA RSHPRWLQEL YGFAAHKPED QEYGITSFCF RARAPFDGKR IRDVLTGELP GVIRAKGHFW TADHPDRVLQ FSQAGSLRTI AQSGRWWAAT PRSDWPADRR VIERIARHWR PPYGDRRQEL VFIGTREMDY GRIHPLIDAC LLRGTPVRRP AMTAAGLASQ GARRG
|
| |