Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1113 |
Symbol | |
ID | 4895159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1155161 |
End bp | 1156939 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640111699 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_001042995 |
Protein GI | 126461881 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.618647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.551148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACT GGGCCTGGGG ACCGCGGATC GAGGATGGCC TCGGCCGCTT CCGGCTCTGG GCGCCGTCGC AGGAGCGGCT GGTGCTGAGA CTAGACGGAA CGGATCACCC GATGACACGC AGCGACGACG GGTGGTTCGA GCTGCAGGTG CCCGCAGAGG CGGGCATGGA CTATGGCTTC GTCCTGGAGA GCGGGCAGGT GGTGCCCGAT CCGGCGGCAC GGGCGCAGGC GGGCGATGTG CACGGCCTGT CGCGTCTGGT CGCGCCGTCC TTCGACTGGC GGCACGACTG GACGGGGCGG CCCTGGGCCG AGACCGTGGT GATGGAGCTG CATATCGGCA CCTTCACCGA GGAAGGCACC TTCCGCGCCG CCATCGAGCA TCTGCCGCAT CTGGCCGAAA TCGGGATCAC GATGATCGAG CTGATGCCCG TGGCGCAGTT CGGCGGCAAC CGCGGCTGGG GCTACGACGG GGTGCTGCTC TACGCGCCCC ATCCCGCCTA TGGCACGCCC GAGGATCTGA AGGCTCTGGT CGACGCGGCC CACGGCCTCG GCATGAGCGT GGTGCTCGAT GTGGTCTACA ACCATTTCGG ACCGGACGGG AATTACCTTG GGGCCTATGC CGCGGACTTC TTCGACCCGG AACGGCACAC GCCCTGGGGC AGCGCCATCG CCTATCACCT GCCGCCCGTG CGCCGCTTCT TCCTCGACAA TGCGCTCTAC TGGCTGACCG AGTTCCGCTT CGACGGGCTG CGCATCGACG CGGCCGACCA TATCCGCGAT CCGGACTCGG ACCCCGAGGT GCTGGTGGAT CTGGCCCGCG CCATAAGGCA GCGCATCCCC GACCGGCCGA TCCATCTGAC CACCGAGGAC AACCGCAACA TCACGCGGTT GCACGAGCGG GGACCCGAGG GACAGGTGGT GCTGCACACG GCCGAATGGA ACGACGACCT GCACAATGTG GCCCATGTTC TGCTGACGGG CGAGACCGAG GGCTATTACT GCGACTTCGT GAAGGACCAC TGGCGCAAAT ATGCCCGCGC GCTGGCAGAA GGCTTCGTCT ATCAGGGCGA GCATTCCGAG CATGAGGGCG AGCCCCGCGG AAAGCCGTCG GGCCATCTGC CGCCGCTCGC CTTCGTCGAT TTCCTCCAGA ACCACGACCA GATCGGCAAC CGCGCCTTCG GCGAGCGGCT GACGACGCTC GCGCCCGAGG CGCGGCTGCG CGCGATGATG GCGGTCCTGC TGCTGTCGCC GCATGTGCCG CTGCTCTTCA TGGGCGAGGA ATGGGGCGAG TCGCGGCCCT TCACCTTCTT CACCGACTTT CACGGAGAGC TCGCCGATGC CGTCCGCAAC GGGCGGCGCA AGGAGTTCGC GCATTTCTCC GCCTTCCAGG GGCTGGATCT CGACCGGACG GTGCCCGATC CGAATGCCGA GGGCACGTTC CTCTCCTCGA AGCTCGACTG GCTGCACCGC GAGACCGAGC GCGGCCGGTC CTGGATGGCC TTCGTCAAGG ATCTGCTCGC CACCCGTGCC CGCGAGATCG CGCCACGGCT CGAGCGCGCA CCCGGAAACG GCGGGCGCAT CGTGGCGGTC TCGGACGATC TGGTGGCGGT CGACTGGCAG CTCGACGGGG CGGTCCTGCG CCTCCGCGCC AATTTCGACG ACCGGCCGCA GGACTTGCCG GAGGCCGGGG GCCGTGTGAT CCATGCCTCT CCCGGCACCG AGGCTGGCGG TCCGCTGCCG CCCTGTTCGG TGCTGGTCAC GCTCGAGGAG ACCGCATGA
|
Protein sequence | MNDWAWGPRI EDGLGRFRLW APSQERLVLR LDGTDHPMTR SDDGWFELQV PAEAGMDYGF VLESGQVVPD PAARAQAGDV HGLSRLVAPS FDWRHDWTGR PWAETVVMEL HIGTFTEEGT FRAAIEHLPH LAEIGITMIE LMPVAQFGGN RGWGYDGVLL YAPHPAYGTP EDLKALVDAA HGLGMSVVLD VVYNHFGPDG NYLGAYAADF FDPERHTPWG SAIAYHLPPV RRFFLDNALY WLTEFRFDGL RIDAADHIRD PDSDPEVLVD LARAIRQRIP DRPIHLTTED NRNITRLHER GPEGQVVLHT AEWNDDLHNV AHVLLTGETE GYYCDFVKDH WRKYARALAE GFVYQGEHSE HEGEPRGKPS GHLPPLAFVD FLQNHDQIGN RAFGERLTTL APEARLRAMM AVLLLSPHVP LLFMGEEWGE SRPFTFFTDF HGELADAVRN GRRKEFAHFS AFQGLDLDRT VPDPNAEGTF LSSKLDWLHR ETERGRSWMA FVKDLLATRA REIAPRLERA PGNGGRIVAV SDDLVAVDWQ LDGAVLRLRA NFDDRPQDLP EAGGRVIHAS PGTEAGGPLP PCSVLVTLEE TA
|
| |