Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0471 |
Symbol | |
ID | 3970233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 507582 |
End bp | 508844 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637923587 |
Product | CBS |
Protein accession | YP_530365 |
Protein GI | 90421995 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.603908 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGATC CCGACCCAGC GCAAGACAAT CCCGTGAGCG ACACGATGCC AAGTAGTAGC AGTCTTCCCG CGGTGGTGCA TCAGGGCGAC GTGATGCGGC CGGCCGCCGA CAATTGGCTG CTGCGGGCGA TCCGCACGCT GTTCGGCTGG AAGCCGGGCT CGGTGCGCGA GGACCTGCAG GTGGTACTCG ATGCTACCAC GCCGGACGAC ACCGGCTTCT CGGCGATCGA GCGCACCATG CTGCGCAACA TTCTCGGTCT GCACGAACGG CGGATCGCCG ACGTCATGGT GCACCGCGCC GACATCGTCG CGATCAAGCA GGACATCACG CTCGGCGAAC TGATGGGGCT GTTCGAGAGC GCCGCGCATT CCCGCCTGGT GGTTTACAAC GAAACCCTCG ACGATCCGGT CGGCATCGTC CACATCCGCG ATCTGTTGGC CTACATGACG GCGCGGGCGC GCGACGAACT ACCGAGCAAG AGCAAGGCGT CGAGCAAGGC GGCGGCCGCT GCGACCCCTG CCGCAGCCGT CAAGACTTCG CCGGCCAAGA CGTCGCCGGC CAAGACGTCG CCCGGCAAAT CCTCGCGCAA GAAACCCTCG CCCAATAGCC TCGATCTGCG CGCCATCGAC CTCAAGATCC CGCTCACCGA GACCGGGATC ATCCGCAAGC TGCTCTACGT GCCCCCCTCG ATGCGGGCGA TCGATCTGTT GGCGCAGATG CAGGCGTCGC GGATTCATCT GGCGCTGGTG GTCGACGAAT ATGGCGGCAC CGACGGCCTG GTTTCGATCG AAGACATCGT CGAGCAGATC GTCGGCGAGA TCGACGACGA GCACGACAGC GCCGAGCCGC CGTCGATCGT GCGGCAGGCC GACAACTCCT TCATCGCCGA CGCCCGCGCC AGCCTCGAGG ACGTCCGCCA GGTGATCGGC GAGGACTTCG TCACCGGCGA GGCCGGCGAG GAGGTCGAGA CGCTGGGCGG CTATCTGGTC ACCCATGTCG GACGCCTGCC GGTGCGCGGC GAAGTGATCT CCGGCCCCGG CAACTACGAG ATCGAAGTGC TCGACGCCGA CCCGCGCCGG GTCAAGCGGC TGCGCATCGG CGTCCGCAAG GAACGCCCGG CCCCGCGGCA ACGCGAATTG CGCCGGCGCG ACGCGCCGAA CGAGCCCGGT CCAGCGCAGG GCAACGAGCC CGGTCCGCCT CAGGGCAACG ACAATGCCAA TGTCGGCCCG GCCACGCCGG GCGACGGAGT CGGCTCGCCG TGA
|
Protein sequence | MPDPDPAQDN PVSDTMPSSS SLPAVVHQGD VMRPAADNWL LRAIRTLFGW KPGSVREDLQ VVLDATTPDD TGFSAIERTM LRNILGLHER RIADVMVHRA DIVAIKQDIT LGELMGLFES AAHSRLVVYN ETLDDPVGIV HIRDLLAYMT ARARDELPSK SKASSKAAAA ATPAAAVKTS PAKTSPAKTS PGKSSRKKPS PNSLDLRAID LKIPLTETGI IRKLLYVPPS MRAIDLLAQM QASRIHLALV VDEYGGTDGL VSIEDIVEQI VGEIDDEHDS AEPPSIVRQA DNSFIADARA SLEDVRQVIG EDFVTGEAGE EVETLGGYLV THVGRLPVRG EVISGPGNYE IEVLDADPRR VKRLRIGVRK ERPAPRQREL RRRDAPNEPG PAQGNEPGPP QGNDNANVGP ATPGDGVGSP
|
| |