Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2600 |
Symbol | |
ID | 3970709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 2825746 |
End bp | 2828505 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637925711 |
Product | glycosyl transferase family protein |
Protein accession | YP_532469 |
Protein GI | 90424099 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5309] Exo-beta-1,3-glucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0728813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.398364 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCG TCGTCGCCGT TCTGCTCTTG GTCACCGCTG CTCATGCTGC CATCTGGGGG ATCTTCAGGG AACAGCAGCA GGCACCGGAT TTCCGAGGCA TCCTTCCCAG CGTTTCCTAC GCACCGTTTG ACGGCACCGG GCATCCGGAC GTCGACAACA TTCCCAATGC CGAGCGCATC CGCTCCGACC TGAAGACGCT GGCGCCACTC AGCCGCGCCA TCCGGTTGTA CTCCTCGACC GGCGGCGTCG AACTGGTGCC GCCGATCGCC GGCGAGGTCG GCCTCAAGGT CACCGTGGGC GCCTGGATCG ACAAGAACGT CGATCGCAAC GAGCGCGAGA TGCTGTCGGC GATCGACCTC GTCAAGCACA ACAGCAACGT CAACGGCATC GTGGTCGGCA ACGAAACCAT CTACCGCGGC GAGCAGAAGG TCGAAGACCT CATCAAGCTG ATCCAGCGCG TCAAGGGCCA AGTCAACGTC CCGGTGACCA CCGGCGAGAT CTGGAACATC TGGCTCGAGC ATCCGGAACT CGCCTCCTCG GTCGACTTCA TCGCCGCGCA CATCCTGCCC TATTGGGAAG GTTTTTCCGA CACCAAGGCG GTCGACCAGG CGCTGATCAT CTATCAGAAG CTGCGCGACG CGTTTCCCGG CAAGCGCATC GTGATCGCCG AATTCGGCTG GCCCTCCGCC GGCTACAATC TGAAGGACGC GGTGCCCGGC CCGTTCGAGC AGGCGGTGAC GCTGCGCAAT TTCGTCAACC GCGCCGAAGC CATCGGCATG GAATACAACA TCGTCGAGGC GATCGATCAG CCCTGGAAAT TCTTCGAAGG CGGCGTCGGT CCGTATTGGG GGATCCTGAA CGCGGCGCGC GAACCGAAAT TCGCCTGGAG CGGCCCGGTG GTCGATCCGG CCTATTGGAA GCTCGCCGGC ATCGCGCTGC TGATCGGCAT CCTGCTGTCG ACGCCGATCC TGCGGCTGGC CCAGCCCAGC GCGATGCAGT CGTTCATGCT GTCGGCGGCC GCGCACGGCG TCGGCGCCTG GGTCGCCACC GTATTCGCCT ACTGGAACGG GCATTATTTC GTGTTCGGCT CGGCGCTGGC GCTAACGCTG GGCTTGATCC TGCTGGTGCC GCTGGTGTGT ATCGCGATGG CGCGGATCGA GGAAATCGCC GCGGTTGCGT TCGGCCGCGC GCCGGTCCGG CTGTTGAAGA AGCCGCCGCC GGCGACCGTG GCGTCGCCGC CGCTGGCGCT GGCCGCGTCC TCCGCAGAGG CGGTCCGCGC CGATGCAGAC CAAGTCGGCG ACGTCGTCGC CGAAGCCCCC AAGATGCCGA AGGTGTCGAT CCATATCCCG GCCTATTTCG AGCCGCCGGA TATGTTGAAG CAGACGCTGG ATGCGGTGGC GCGGCTGGAT TATCCGAACT TCGAATGCGT GGTGATCATC AACAACACCC CGGATCCGGA ATTCACTCAG CCGATCCAGG ATCACTGTCG CGAGCTCGGC GAGCGCTTCA AATTCATCAA CGCCGAGAAG GTCGAGGGCT TCAAGGCCGG CGCGCTGCGC ATCGCCATGG AGCGCACCGC CGCCGACGCC GAGATCATCG GCATCATCGA CGCCGACTAT ATGGTCGAGC CGGACTGGCT GAAGGATCTG GTGCCGGCGT TCGATGACCC GCGGGTCGGG TTGGTGCAGG CACCGCAGGA CCATCGCGAC GGCGACCGTT CGCTGATGCA CTACATCATG AACGGCGAAT ATGCCGGGTT CTTCGACATC GGCATGGTGC AGCGCAACGA GCTCAATGCC ATCATCGTGC ACGGCACGAT GTGCCTGATC CGCCGCGCCG CGATGGAGAT GGTCGGCGGC TGGGCCGGCG ACACCATCTG CGAGGATAGC GACCTCGGCC TGGAAATCAT CGAGCATGGC TGGCTGACCC ATTACACCAA CCATCGCTAC GGCTATGGCC TGTTGCCGGA CACCTATGAG GCCTTCAAGA AGCAGCGGCA TCGCTGGGCC TATGGCGGCT TCCAGATCAT CAAGAAGCAT TGGCGGCGCT TCCTGCCGGG CGCCAGCCGG CTGACGCCGG ATCAACGGCG GGAATTCGCG CTGGGCTGGC TGAACTGGCT CGGCGCCGAG AGCCTCGGCG TGGTGGTGGC GATCCTCAAT CTGATCTGGG TTCCGATCGT CGCCTTCGCC GACATCGCGA TCCCCGACAA GATCCTGACG CTGCCGATCA TCGCCTCGTT CGTGGTGACG CTGGCGCACT TCCTGGTGCT GTACCGGCTG CGGGTGAAGA TCACCGTGCC GCGGATGCTG GGCGCGATGA TCGCGGCGAT GTCGGTGCAG TGGACGGTGT CGCGCGCGGT GGCGCAGGGC CTGATCACCG AGCACCTCGC CTTCGCCCGC ACCTCCAAGG GCGGCTTTTC GCTGATGTCG GTCGAGTTCC AGGCGTTCTG GGAGGCGGTG ATCGGCGTGC TGCTGCTGGT CGGCGCCGCC GTGCTGGTGG CCTCCAACGC CTATAAGGAA GTCCACGAGA TCTACATCTT CGCCGCGGTC TTGGTGCTGC AGAGCCTGCC GTTCCTGGCC GCGGTGGCGA TCGCGATCCT GGAGAACTCG CGGATCAATT CGTTTGCGTT CTGGAAGAAC ACCGGGGTGC GCACCGCCGA ATTGATCGGG CTGCGGCCGG TGGCGTTGCC CAAGACCGTG CCGGCGGCGA TGCCGGCGCC GCAGCCGGTG GTGTCGGAAA TCCACCGCGA TACCGTGTAA
|
Protein sequence | MRAVVAVLLL VTAAHAAIWG IFREQQQAPD FRGILPSVSY APFDGTGHPD VDNIPNAERI RSDLKTLAPL SRAIRLYSST GGVELVPPIA GEVGLKVTVG AWIDKNVDRN EREMLSAIDL VKHNSNVNGI VVGNETIYRG EQKVEDLIKL IQRVKGQVNV PVTTGEIWNI WLEHPELASS VDFIAAHILP YWEGFSDTKA VDQALIIYQK LRDAFPGKRI VIAEFGWPSA GYNLKDAVPG PFEQAVTLRN FVNRAEAIGM EYNIVEAIDQ PWKFFEGGVG PYWGILNAAR EPKFAWSGPV VDPAYWKLAG IALLIGILLS TPILRLAQPS AMQSFMLSAA AHGVGAWVAT VFAYWNGHYF VFGSALALTL GLILLVPLVC IAMARIEEIA AVAFGRAPVR LLKKPPPATV ASPPLALAAS SAEAVRADAD QVGDVVAEAP KMPKVSIHIP AYFEPPDMLK QTLDAVARLD YPNFECVVII NNTPDPEFTQ PIQDHCRELG ERFKFINAEK VEGFKAGALR IAMERTAADA EIIGIIDADY MVEPDWLKDL VPAFDDPRVG LVQAPQDHRD GDRSLMHYIM NGEYAGFFDI GMVQRNELNA IIVHGTMCLI RRAAMEMVGG WAGDTICEDS DLGLEIIEHG WLTHYTNHRY GYGLLPDTYE AFKKQRHRWA YGGFQIIKKH WRRFLPGASR LTPDQRREFA LGWLNWLGAE SLGVVVAILN LIWVPIVAFA DIAIPDKILT LPIIASFVVT LAHFLVLYRL RVKITVPRML GAMIAAMSVQ WTVSRAVAQG LITEHLAFAR TSKGGFSLMS VEFQAFWEAV IGVLLLVGAA VLVASNAYKE VHEIYIFAAV LVLQSLPFLA AVAIAILENS RINSFAFWKN TGVRTAELIG LRPVALPKTV PAAMPAPQPV VSEIHRDTV
|
| |