Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4058 |
Symbol | |
ID | 5086231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 101907 |
End bp | 103544 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640485621 |
Product | hypothetical protein |
Protein accession | YP_001170215 |
Protein GI | 146280058 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00410401 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.150946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCC CCGGCCCCGA TCCGGGCGCC CCGGCGCTCC GGGGCGGCGC CATCCGCCCC GGAGGACGGC CGGGTTGGGC GGCGCGGCTC TCCGCCCTTG CGGGCCCCCT GCTCGCGGCG GCGCTGGTCT TCGCCGGCCC CGCCCGCGCC ACGGACTGGC CGGTGGAGCA GTATGATCCC GGCGCGGCCG AGCGTCCGGC CGACCTGATC CTGCCCATGC CCTGCGGCGG GGCGATGGCT TTCCAGAAGG TGGTCGTGCC GGTCGAGGCC GCCGATCCGC TCGACGACCG CCGCCTGCGC CTCGGCCAGT CGCAGCCCGA GACCGGCTAT TCCGACTATC TGCGCACCGA GCATCTGCGC GGCCCCTTCG CCTCGGACGA GGCCACCTTC TACTACATCG GCCGCTATGA GGTGACGCGC GCGCAGCAGC GGGCGCTGGC CTTCGACTGC GCCCCGCCCA GCCGGATGGA CCGGACCGCG GCGGCGGGGC TGTCGTGGTT CGATGCGGTG GCGCTGTCGC AGCTCTACAG CGAATGGCTG CTGGCCGAGG CCCCGGACGC GCTGCCCCCC GAGGCGGAGG GGCTGGCCTT CCTGCGCCTG CCGACCGAGA CCGAGTGGGA ATATGCCGCC CGTGGCGGGG CCGCGACCGA CGCCACGCAG TTCGCCTCGC GGCGCTATTT CTCCGAAGGT CAGATCGCCG ATCATGCGAT GGCGCAGGGC TCGGCGCGGG GAGAGGTGCT GCCCGTCGGG CTGCGCAGGC CGAACCCGCT GGGGCTCCAT GACATCTACG GCAATGCCGA GGAGCTGATG CTCGAACCCT TCCGGCTGAA TGCGGTCGGG CGCCCGCACG GGCAGGTGGG GGGGCTCGTC ACCCGGGGCG GCTCGGTGCT CTCGGCCCCC GAAGAGCTCT ATTCCGCGCA GCGGCGGGAA TATCCGCTCT ACCGCGCCGC CGACGGCAAG GCGCTGGCCG GGGCCACCTT CGGGCTGCGC CTCGTGCTGA CGCGCGATGT CACCTCGTCG GACGCCCGCC TGCGCGCGAT CCGCAGCCGC TGGCTCGACC TGGCCGAGGC GCCGGCGGCG GAGGCCTCGG ATCCGCTGGT CACGCTCTCG GCGCTGATCG AGGAAGAGGC CGACCCGCGC CGGCAGTCGG CTCTGACCGA CCTCCAGCTC GAATTCCGGC TGGCGCGCGA TGCGGCGGCG GCGGCCTTCC GGGAATCGGC GAAATCCACG CTGCTGAGCG GCGCGGTCTT CATCGCGGCC CTGGCCGACG GCGCGCGCGA GATCGACCGC CAGACCGGCA ATGTCCGCGC CATGGTGGAC CAGATCCGGG TGAGCGACGG GGCGCAGCGC GAGGCGCTCA TCGCGGGGGC CGAGCGGGTG AACCGGCAGC TGAGGATGCT GCGCGACCTG CAGCACACCT ATCTTCTGTC CTACCGCAGC GCGCTCGAGA CCCTCTCGTC GGAGATCGAG GGCGAGGTGG TGGAGACGGC CTTCGGCCTG CTCCAGCAGG AGCTTGCGGC CTCGGGCCAG ACCGGGATCC TGTCGGGGCT GGAGGCGCTG AACGAGGATC TCGCCCGCTT TGCCGCCCGG CCGGACATGG TCGAGGCCGA GCTGCTGGCC CTGGCGCTGG AACGCTAG
|
Protein sequence | MRIPGPDPGA PALRGGAIRP GGRPGWAARL SALAGPLLAA ALVFAGPARA TDWPVEQYDP GAAERPADLI LPMPCGGAMA FQKVVVPVEA ADPLDDRRLR LGQSQPETGY SDYLRTEHLR GPFASDEATF YYIGRYEVTR AQQRALAFDC APPSRMDRTA AAGLSWFDAV ALSQLYSEWL LAEAPDALPP EAEGLAFLRL PTETEWEYAA RGGAATDATQ FASRRYFSEG QIADHAMAQG SARGEVLPVG LRRPNPLGLH DIYGNAEELM LEPFRLNAVG RPHGQVGGLV TRGGSVLSAP EELYSAQRRE YPLYRAADGK ALAGATFGLR LVLTRDVTSS DARLRAIRSR WLDLAEAPAA EASDPLVTLS ALIEEEADPR RQSALTDLQL EFRLARDAAA AAFRESAKST LLSGAVFIAA LADGAREIDR QTGNVRAMVD QIRVSDGAQR EALIAGAERV NRQLRMLRDL QHTYLLSYRS ALETLSSEIE GEVVETAFGL LQQELAASGQ TGILSGLEAL NEDLARFAAR PDMVEAELLA LALER
|
| |