Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1226 |
Symbol | |
ID | 3969103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 1341269 |
End bp | 1343089 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637924337 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_531108 |
Protein GI | 90422738 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00014038 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACCAAG TTTCCCCGCC CGCTCCGCCC ACCCGATTCG GCCCCGAGAT CGACGACCAG GGGGTGCTGT TTCGGTTATG GGCGCCAAGC CAAAACCAGG TGACGGTGGT TCTTGAAGGC GGCAGTGCGC TGGCGATGAA CAAAACCGAC CTCGGCTGGC ACAGTTTGCT GGTTCAGGAG GCCGGCCCCG GGACCCGCTA TCGGTTTCGC CTGGCCGACG GCCTGGAGGT GCCCGACCCG GCGTCGCGTT ATCAACCGGA GGACGTTCAC GGTCCGAGCG AAGTGGTCGA TGCCATGGCG TTCGCCTGGT CCGACCACGG CTGGCGCGGC CGGCCCTGGG AAGAAGCGGT GGTCTACGAG CTGCACGTCG GCGCCTTCAC CGAACCAGGT ACCTTTGCCG CCGCGATCGA CCGGCTGGAT TATCTCGCCG AACTCGGCGT CACCGCGATC GAATTGATGC CGGTGGCGGA GTTTCCCGGC GCCCGCAATT GGGGCTACGA TGGCGTGCTG CTGTTTGCGC CCGACGCCAG CTACGGGCGG CCGGACGACA TGAAGGCGCT GGTCAACGCC GCCCATGCCA AGGGCATCAT GGTGTTTCTC GACGTGGTCT ATAACCATTT CGGGCCCGAC GGCAATTATC TCGCCGCCTA CGCGCCGATC TTCAACGACC AGCACCAGAC CCCGTGGGGC GCGGCGGTCA ATTACGACGC GGCCGGATCG GAGACGGTGC GGGAATTCGT GATCCAGAAC GCGATCTACT GGATCGACGA GTTTCACCTC GATGGACTGC GGTTCGACGC GGTGCATGCC ATCAAGGACG ACAGCGACCC GCATCTGTTG GCGGAGATCC CCTCGCGATT GCGCGCCAGC GGCATCAACC GGCCGATCCA TCTGATGCTG GAGAACGAGG AGAACGAGGT GGCGAGGCTG GCCCGCGACG TCGACGGCGA GCCGCAGCAA TACACCGCGC AGTGGAACGA CGACCTGCAC CACGTGCTGC ACACCGCCGC CAGCGGCGAA CGGTCGGGCT ACTACGCCGA ATATGCCGGC GACACCGAAA AACTCGGTCG CGCCTTGGCT GAGGGCTTCG CGTTCCAGGG CGACCATATG CGTTACCGCG ACCGCAGCCG CGGCGCGCCC AGCCGTCACT TGCCGCCCAC CGCGTTCGTC GGCTTCATCC AAAACCACGA CCAGATCGGC AATCGCGCCT TTGGCGAACG GCTGACCGCG TTCGCGCCGG CGGTCGCCGT GCAAGCGGTG GCGGCGGTCT ATCTACTGCT CCCGCAGATT CCGATGCTGT TCATGGGCGA GGAATTCGGC TCGTCGCGGC CGTTTCCGTT CTTCTGCGAT TTTTCCGGCG ATCTCGCCGA CGCGGTCCGC GAGGGACGCC GCAAGGAGTT CGCGCGGTTT CCGGAATTCG CCGATCCGCA GATGCGCGAA GCCATCCCCG ACCCCGGCGC CGCCGGGACC TTCGCGGCCG CAAAGCTCGA CTGGAACGAA CCCCCACGCG AGCTCCATGC GCAATGGCTA GAGTTTTATC GCGCCCTGCT GGCGATCCGG CATCGCGAGA TCGTGCCGAG GCTGCAAGCC ATCGGCGGCA ACGCCGGCCG GTTCAAGGTT CTCGGACGCC TGGCGGTGCA GGTCGAGTGG ACCCTGGCGG ATGGAGCACA TTTGACGCTG ATGGCTAACC TGACCGACCA GCCGCTCCCG GGGATCTCGC GGCCGAGCGG CCGCCGCTTA TGGCCGACCG CGCCCGTGGC GTCGTCGACG CTGGAGCCCT GGCAGGTGGT GTGGTCGATC GCCGATGCGG GTCAAGCCTG A
|
Protein sequence | MDQVSPPAPP TRFGPEIDDQ GVLFRLWAPS QNQVTVVLEG GSALAMNKTD LGWHSLLVQE AGPGTRYRFR LADGLEVPDP ASRYQPEDVH GPSEVVDAMA FAWSDHGWRG RPWEEAVVYE LHVGAFTEPG TFAAAIDRLD YLAELGVTAI ELMPVAEFPG ARNWGYDGVL LFAPDASYGR PDDMKALVNA AHAKGIMVFL DVVYNHFGPD GNYLAAYAPI FNDQHQTPWG AAVNYDAAGS ETVREFVIQN AIYWIDEFHL DGLRFDAVHA IKDDSDPHLL AEIPSRLRAS GINRPIHLML ENEENEVARL ARDVDGEPQQ YTAQWNDDLH HVLHTAASGE RSGYYAEYAG DTEKLGRALA EGFAFQGDHM RYRDRSRGAP SRHLPPTAFV GFIQNHDQIG NRAFGERLTA FAPAVAVQAV AAVYLLLPQI PMLFMGEEFG SSRPFPFFCD FSGDLADAVR EGRRKEFARF PEFADPQMRE AIPDPGAAGT FAAAKLDWNE PPRELHAQWL EFYRALLAIR HREIVPRLQA IGGNAGRFKV LGRLAVQVEW TLADGAHLTL MANLTDQPLP GISRPSGRRL WPTAPVASST LEPWQVVWSI ADAGQA
|
| |