Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0646 |
Symbol | |
ID | 3970624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 700223 |
End bp | 701221 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637923763 |
Product | CBS |
Protein accession | YP_530538 |
Protein GI | 90422168 |
COG category | [R] General function prediction only |
COG ID | [COG0517] FOG: CBS domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGCGC GTGACGTAAT GACCACTGAC GTTTCAGTCG TGGGGCCGAA TTCTTCCTCG GCGGAAGTGG CACGCATTCT GCTCGCCACA CGGGTCAGCG CCTTGCCTGT CGTCGACCAC GACGGCGCCC CGATCGGCGT CGTCAGCGAA TGGGATCTGG TCGGCCAGCA TGCGACGGAC CGTGTCGCCA AGCGTGAACG GTGGCTGTCG CACTTGGCTG AGGGGCAACC GTTGGCGGCC GACTTCCTGC AATCGGTCGA TCCGACCAAC CGCACTACGG CCGAAATCAT GCATCAACCA GTGATCGCCG TCCCGGAGAC GACGCCGATC GCTGAAGTCG CGCGCTTAAT TGCCGAACAT CGGATCAAGC GTGTCTTCGT CACGCGTGGT GATCGCCTGG TTGGCGTAGT TTCGCGGATC GACTTGGTAC GGGCGCATCT GCTTGAGGCC ACTCCTGTCG CGCTGCATCC GCGTCGCGTC CCGGTGGGAG AGGAATCTTC TGACGTCTCG ACGGCACGCT CCGGCCCGCC GCCGGTGCCG AACCCAAGCA CTTCGGCACC AACGATTGCA AGCGGACCGA GCGCCGCCGA ATTTGACCTG TTGGTCGCAG CCGCGGAACA AGCCGAACAG ATGCAGCGCA CGGCCGCCGA GCGCATCGCC AGCGATGCGC GGCGCGCGCT GGTTACCGCG CTCCAGCAGA AGGTGCTCAG CCACGCGGCC TGGCAGTCGT TGATCGAGCG CGCTCGTCAC GTCGCGAAGC GCGGCGGCCG GGAATTCCTG CTCATCCGAT TTCCGTCGGA ACTGTGCAGC GACCGGGGAC GCGCGATCAA TGCGGCCGAT CCGAAGTGGC CCGAATCCTT GTGCGGCGAG GCCGCCGACG TGTTCGAACG CTGGCAACGG GAGTTGAAGC CGCAGGGTTT TGGCCTGACC GCTCAGATCC TTGACTTCCC AGGCGGCTTT CCTGGCGACG CAGGCCTCAC ACTAAGGTGG GGGCGTTGA
|
Protein sequence | MLARDVMTTD VSVVGPNSSS AEVARILLAT RVSALPVVDH DGAPIGVVSE WDLVGQHATD RVAKRERWLS HLAEGQPLAA DFLQSVDPTN RTTAEIMHQP VIAVPETTPI AEVARLIAEH RIKRVFVTRG DRLVGVVSRI DLVRAHLLEA TPVALHPRRV PVGEESSDVS TARSGPPPVP NPSTSAPTIA SGPSAAEFDL LVAAAEQAEQ MQRTAAERIA SDARRALVTA LQQKVLSHAA WQSLIERARH VAKRGGREFL LIRFPSELCS DRGRAINAAD PKWPESLCGE AADVFERWQR ELKPQGFGLT AQILDFPGGF PGDAGLTLRW GR
|
| |