Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1886 |
Symbol | |
ID | 3908081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2158605 |
End bp | 2160410 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637883780 |
Product | glycoside hydrolase 15-like protein |
Protein accession | YP_485505 |
Protein GI | 86749009 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.177282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGGGTA GGATCGAAGA CTATGCGCTG ATCGGCGATT GCGAAACCTC GGCACTGGTC GGTCGCAATG GATCGATCGA CTGGCTGTGC TGGCCGTCGT TCGATTCCGA CGCCTGCTTC GCAGCTTTAT TGGGCGATGA AAATCACGGC CACTGGCAGA TCGGCCCGTC GCAGGAGCTC AGGACGACGA CGCGGCGCTA TCGCGGCGAC AGTCTGATCC TCGAGACGCA ATTCGAGACC GACACCGGAA CCGTCACGCT GATCGATTTC ATGCCGATCC GCGGCAAGGC GTCGGACATC GTGCGCCTGG TCCGCGGCGA CGCCGGAAAA GTCGAGATGC GCATGCATCT GGTGCTGCGC TTCGGCTTCG GCGTCAACAT TCCGTGGGTC AAGCGCGGCG AGAAGCCGGG TGAATTGCTG GCGATCTGCG GTCCGGACAT GACGGTGCTG CGCTCGCCCG TGCGCACCCA CGGCGAAGGC CTGACGACGG TCGCTGAATT CGTGGTGAGC GAAGGCGACA CGGTGCCGTT CGTGATGACC TACGGCGCGT CGCACCTGCC GTTGCCCGGC ACGATCGATC CGATCGCCGC ACTGCGCGAC ACCGAGGAGT TCTGGTCGGA CTGGACCGGC CAATGCACGT ACACCGGCGA GTATCGCGAT TTGGTGATGC GGTCGCTGAT CACGCTGAAG GCGCTGACCT TCGCGCCCAC CGGCGGCATC GTCGCGGCGC CGACGACGTC GCTGCCGGAG CAACTGGGCG GCGCGCGCAA TTGGGACTAT CGGTTCTGCT GGCTGCGCGA TGCGACCTTC ACGCTGTTCG CGCTGATGAA CAACGGCTAC ACCGCGGAGG CTGCGGCCTG GCACGGCTGG CTGCTGCGCG CAGCCGCCGG CGCGCCGGCC AAATTGCAGA TCATGTACTC GATCGACGGC CATCGCCGGC TGCTGGAATG GCAGGCCGAC TGGCTCCCCG GCTACGAGGG GGCGCAGCCG GTGCGGATCG GCAATGCGGC CCACGCGCAA CTGCAGCTCG ACGTCTACGG CGAGCTGATC GACGCCTTCC ACCAATGGCG GATGGCTGGG ATCGAACTCG ACGGCGATTC CTGGGCGATG GAATGCGCCG TACTCAAGCA TCTGTCGACG ATCTGGAGTC AGCCGGACAG CGGCATCTGG GAGCGCCGCG GCCCGGCCAA ACATTACGTC TTCTCGAAGG TGATGACCTG GGTCGCTTTC GATCGCGGCA TCAAAAGCGC GGAAAAATTC GGCCTCAAGG CGCCGCTGGC GGCATGGCGC GCGCTGCGCG ACGAGATTCA TCGCGACGTC TGCGCCAAGG GCTTCGATGC GAAACAGAAT GCCTTCGTCG ATCACTACGG CGCCGACGTG CTGGACGCCA GCGTGCTGTT GATCCCCGCG GTCGGCTTCC TGCCGCCGGA CGACCCGCGC GTGCGCGGCA CGGTCGCGGC GATCGAGGCC CACATGATTC ATGATGGATT CGTGCTGCGC CACGATCCGC GCGAAACCCC CGACGAACCG CTTCCGGTCG AAGGCGCGTT CCTCGCCTGC AGCCTGTGGC TGGCCGACGC CTATGTGTTC GACGGCAGGA TCGACCAAGC CAAGGTGCTG TTCGATCGCG TCGTCGGCAT CGCCAACGAC GTCGGCCTTC TCGCCGAGGA ATATGATTCC GTTGCCGGCC GGCAGACGGG CAATTTCCCG CAGGCGCTCA CTCACATCGC GCTGATCATC ACCGCCAATA ATCTCAGCGC GGCGAAGGCC GCAACCGGCA AGCCGGCGGT GCAGCGCTCG AAATAG
|
Protein sequence | MPGRIEDYAL IGDCETSALV GRNGSIDWLC WPSFDSDACF AALLGDENHG HWQIGPSQEL RTTTRRYRGD SLILETQFET DTGTVTLIDF MPIRGKASDI VRLVRGDAGK VEMRMHLVLR FGFGVNIPWV KRGEKPGELL AICGPDMTVL RSPVRTHGEG LTTVAEFVVS EGDTVPFVMT YGASHLPLPG TIDPIAALRD TEEFWSDWTG QCTYTGEYRD LVMRSLITLK ALTFAPTGGI VAAPTTSLPE QLGGARNWDY RFCWLRDATF TLFALMNNGY TAEAAAWHGW LLRAAAGAPA KLQIMYSIDG HRRLLEWQAD WLPGYEGAQP VRIGNAAHAQ LQLDVYGELI DAFHQWRMAG IELDGDSWAM ECAVLKHLST IWSQPDSGIW ERRGPAKHYV FSKVMTWVAF DRGIKSAEKF GLKAPLAAWR ALRDEIHRDV CAKGFDAKQN AFVDHYGADV LDASVLLIPA VGFLPPDDPR VRGTVAAIEA HMIHDGFVLR HDPRETPDEP LPVEGAFLAC SLWLADAYVF DGRIDQAKVL FDRVVGIAND VGLLAEEYDS VAGRQTGNFP QALTHIALII TANNLSAAKA ATGKPAVQRS K
|
| |