Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1880 |
Symbol | |
ID | 3908075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2145200 |
End bp | 2146963 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883774 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_485499 |
Protein GI | 86749003 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGGGC GGAGCTTCGG CCCGCGGCTG ACCGAGAACG GCGCCGAATT CCGGCTGTGG GCTCCCAACG CAACGCGCGT CGACGTCGTG CTCGATCGTC CGCATGCGAT GCAACGCGAC AAGGACGGCT GGTTCAAGGT CGAGATCGAC GGGGCGCGCG GCGGCACGCG CTATCGCTTC CGCATCGACG ATACGACCGA CGTTCCTGAT CCCGCATCCG GCTTCCAGCC CGAGGATATT CAGGGCCCGA GCGAAGTGAT CGATCATGCG GCCTATCCGT GGCGCGCCGG CGAATGGCGT GGTCGGCCGT GGCACGAGAC GGTGCTGCTG GAAGCGCATG TCGGCAGCTT CACGCCGCAG GGGACGTTCC GCGCGATGAT CGACCGGCTC GATCATCTGG TCGACACCGG CGTGACCGCG CTGGAGCTGA TGCCGCTGGC GGATTTTCCC GGGCGCCGCA ATTGGGGTTA TGACGGCGTG CTGTGGTACG CACCCGACAG CGCCTATGGG CGGCCGGAGG ATCTCAAGGC GCTGATCGAC GCGGCGCACG AGCGCGGCCT GATGATGTTC CTCGACGTCG TCTACAATCA TTTCGGCCCC GAAGGAAACT ACATCGGCCA ATACGCGCCG CCGTTCTTCT CGGATTCGCA CACGCCGTGG GGCAACGGGA TCAATTACGA CGTCGAACAG GTCCGGGCCT TCGCGATCGA GAATGCCGTG TACTGGCTAC GCGAATATCA TTTCGACGGG TTGCGCCTCG ATGCCGTGCA CGCCATCCCG GATCAGGGCG AAATCCCGAT GCTGCACGAA CTGAGCCGGG AAGTCGGCAA GCTCGCCGCA GAGACCGGCC GCCACATCCA CCTGGTGCTC GAAAACGACG ACAACATCGC CGCCGTCCTC GATCCTGTCG TCGATCCCCC GCGCGGACAG TATCGCGCGC AATGGAACGA CGACTATCAC CACGCCTGGC ACGTCGCCCT GACCGGCGAG AAGCAGGGGT ACTATTCCGA CTATGCAGAC GCGCCGCTGA ACGCCATCGC CCGCGCGCTG GGCTCGGGCT TCGTCTATCA GGGCGAGCCG TCCGGGCATC GCGGCGGTCA GCCGCGCGGC GAGCCCAGCG GCAAGCTGCC GCCGCTCGCT TTCGTCAATT TCCTGCAGAA CCACGATCAG ATCGGCAATC GCGCGCTCGG CGACAGGCTG GAAAGCCTGG CGAAGCCGAA GGCGGTCGAA GCCGCGCTCG CGATCACGCT GCTGGCGCCG ACGATCCCGA TGTTGTTCAT GGGCGAGGAA TGGGGATCGC AGGCGCCGTT TCCGTTCTTC TGCGATTTCC ACGGCGAACT CGCCGATGCC GTCCGCCGGG GTCGACGCAA GGAATTCGCC GGCGCCTACG AGACATACGG CGACGAGGTC CCCGACCCGC TCGACGAATC GACCTTCAAG AGCGCGGTCA TCGACTGGAG CGAGCGCGAC GACGGCCGCG GCGCAGCGCG GCTGGCGCTC GTGAAGCGGC TGCTCGACAT CCGGCGCAAC ACTCTCGTGC CGCGCCTGCC AGGCGCCCGC TTCGGCAATG CCGAGATCGC CGAGGACGGC CTGCTCCGCG CCCGCTGGCG GCTCGGAGAC GGCGCCACGC TCAAGCTCGT CGCCAATCTG TCGGACCACG ACGTCGCCTT CAAGGCACCG CCCGATGGAA CTAATGTGTG GGGCGACGAT TGGAACGGGA TGATTCCGCC GTGGGCGGTG ACCTGGCGCC TCGAAGAGTC CTGA
|
Protein sequence | MKGRSFGPRL TENGAEFRLW APNATRVDVV LDRPHAMQRD KDGWFKVEID GARGGTRYRF RIDDTTDVPD PASGFQPEDI QGPSEVIDHA AYPWRAGEWR GRPWHETVLL EAHVGSFTPQ GTFRAMIDRL DHLVDTGVTA LELMPLADFP GRRNWGYDGV LWYAPDSAYG RPEDLKALID AAHERGLMMF LDVVYNHFGP EGNYIGQYAP PFFSDSHTPW GNGINYDVEQ VRAFAIENAV YWLREYHFDG LRLDAVHAIP DQGEIPMLHE LSREVGKLAA ETGRHIHLVL ENDDNIAAVL DPVVDPPRGQ YRAQWNDDYH HAWHVALTGE KQGYYSDYAD APLNAIARAL GSGFVYQGEP SGHRGGQPRG EPSGKLPPLA FVNFLQNHDQ IGNRALGDRL ESLAKPKAVE AALAITLLAP TIPMLFMGEE WGSQAPFPFF CDFHGELADA VRRGRRKEFA GAYETYGDEV PDPLDESTFK SAVIDWSERD DGRGAARLAL VKRLLDIRRN TLVPRLPGAR FGNAEIAEDG LLRARWRLGD GATLKLVANL SDHDVAFKAP PDGTNVWGDD WNGMIPPWAV TWRLEES
|
| |