Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4168 |
Symbol | |
ID | 6411852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4470404 |
End bp | 4473190 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714050 |
Product | malto-oligosyltrehalose synthase |
Protein accession | YP_001993139 |
Protein GI | 192292534 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.487417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCCTG CGATACCCAC TGCCACCTAT CGCCTGCAAC TCACCGCTGA TTTTGGATTC GACGCCGCCA CGGCGATTGT CCCCTATCTG AAACGGCTCG GCATTTCGCA CGTCTATGCC TCGCCGTTCA TGAAAGCCCG CAAGGGCTCG ACCCACGGCT ACGACATCGT CGACCACACC AAGCTCAATC CTGAGCTCGG CGGCGAAGAG GGGTTCGCCC GCCTCAGCGC CGCACTGAAA CAGCACGACA TCGGCCTGAT CCTCGACTTC GTGCCCAATC ATGTCGGCGT GCACTATGCC GACAATCCGT GGTGGCTCGA CGTGCTGGAA TGGGGTCCGG CCTCGCCGCA CGCCGCATCG TTCGACATCG ATTGGGAGAT GCTGCCGTTC CGCGCCCGCG GCGGCGTGCT GCTGCCGATC ATCGGCTCGT CCTACGGCAA GACGCTGGAA GCCGGCGAGA TCGAGCTGCG CTATGACGCC GCCGACGGCA GCTTCTCGGC GTGGTACTAC GAACACCGGC TGCCGATCGC GCCGCAGCGC TACAGCGAGA TCCTGCGCAC CATCGTGCGC GAGGCCGGCG CGGAGAACAG CGACGCGGGT CGCGCGATCC TCGACCTCGC CGCGCGCTAC ACCGGACTGG GCCATCCGTC GCGAAAGGAA GCACCCGAGT TCAAGGCGGC GCTGCAAGCG ATCCCCGGCG CCGCCGAGGT AATCACGCGC GGGCTCAACG CTTACCGGGC CGGCGAGGGC CGCATCCAGC AGATCCAGGC GCTCCACAAT CTTTTGGAGC GCCAGCACTA CAAGCTCGGC CATTGGCAAC TGGCGTCGAG CGAGATCAAC TATCGCCGGT TCTTCGACGT CAACACCCTC GCCGGCTTGC GGGTCGAAGA CGCCGGCACC TTCGAGGCGA TCCACAGCCG GGTGAAGAAG CTGCTGGCCG ACGGCCAGTT GCAGGGCCTG CGCCTCGATC ACATCGATGG CCTGCGCGAT CCGGCGCAGT ACTTCCAGCG GCTGCGTCGG CTGGCGCGGG ACGCGCAGGG CGCGGGTGCT CCGCCGCTCT ACACCGTGAT CGAGAAGATC CTCGGCGAAG GCGAAGCGCT GCACCGCTTC GCCGGCGTCC ACGGCACCAC CGGCTATGAA TGGCTCAACG TCATCACCCG CGTGCTGCTC GACGGCCGCG GCCTGAAGCC GCTGGACGAA ACCTGGCGGC AGGCCAGCAA CCTGTCGCCC GCATTCGATC CGGTGCTCAA GGCCGCCAAG CGCCGCGTGC TGGAAACGCT TCTGCTCAGC GAATTCACCG TGCTGACACG TCTGCTAGCA CGGATCGCCT CCGGCCACTA CTCGACCCGG GATTTTTCCG CCGACAATCT GCGGCAGGTG TTCGAGCTCT ACGTGCTGCA CTTCCCGGTG TATCGCACCT ACCTGACCGG CAACAGCCCG ACCCAGCTCG ACCGCAAGCT GATCGAAGAC ACCATCGCCA AGGCGCGGGC CGACTGGTTC GGCGCCGACG ACGGCATCTT CGAATTCCTC AAGGATGTGC TGACGATGGA CCTGGTGAAG CCGGGCCGCG CGCTGCATTC CAAGCCGCGG GTGCGCCGGT TCGCGCTGAA GGTGCAGCAG TTCACCGGGC CGACCATGGC CAAGTCGCTG GAGGACACCG CGTTCTATCG CTATCACCGG CTGCTCGCGC TCAATGAAGT CGGCGGCGAT CCGGCCGCGC CGGAGATGCC GATCGCGGCG TTTCACGACG CGATGCAGAG CCGCGCCAAG GACTGGCCGC ACGGCATGAC CGCGACGATG ACGCACGATG CCAAGCGCGG CGAAGATGCG CGCGCGCGGC TGTTGTCGCT CGCCGAGATC CCCGGCGAGT GGGCGAGCGC GGTCGGCAAA TGGAAGCTGC TCAACGCGCC GCATCTGGTC GTCGACGGCG ACATGCGCGC GCCTTCGCCG GCGTTCGAAT ACATGCTGTA CCAGGCCCTG ATCGGTGCAT GGCCGCTGTC ACCCGATCCA GATTTCACCG ACCGCTTCCA GGGCTTCGCG CTGAAGGCAG CGCGCGAAGG CAAGCAGGAA ACTAACTGGC TCAACCCCAA CCTCGCCTAT GAGGAAGGCA TTCGCATCTT CATCGATCGC CTGCTCGATC CGAAGCTGTC GGGCCCGTTC CTGGAATCGG TCGACAGCCT GCACCGACGG CTGTCGCTGC TCGGCGCTTT GAACGGCCTC AGCCAGCTGA CGTTGAAGGC GACGATGCCC GGCGTCCCCG ATTTCTATCA GGGCACCGAG TTCTGGGACT TCTCGCTGGT CGATCCCGAC AACCGCCGCC CGGTTGATTT CGACGCGCGT GCAGCGGCGC TGGAGTCATT GGACGACAAG CCGGACTGGA AAGCCCTGAC TGCGAAGTGG AGCGACGGCC GCGTCAAGCT GGCCTGGACG CACCATCTAC TCAAGCTGCG CCGCGACCAC GCCGCGTTGT TCAGTGACGG CGACTACCGG CCGCTGGCAG TGAAGGGCGC ACACCGCGAC CACATCGTCG CCTTCGCCCG CACCAGCGGC AGCGAGGCGG TGATCGTCGT CGTGGCAAAG GGCCTCGCTG CGTTATCCGA TGAAGGCCGG CAATGGCCAA CCGGCGATGC GTTCGACGGC GCGATCGAGA CCAAAGGCTA CGCCGTGGAA ATCGGCGACG GCGAGACCAC GTCTGGCGAG TTGCAGCTCC GCGACCTGTT CCGCCACCTC CCTGTCTCCG TGCATCGCGC GCGTCTCACA GGCGCGAAGC GAGTCCGGCG GGGCTGA
|
Protein sequence | MPPAIPTATY RLQLTADFGF DAATAIVPYL KRLGISHVYA SPFMKARKGS THGYDIVDHT KLNPELGGEE GFARLSAALK QHDIGLILDF VPNHVGVHYA DNPWWLDVLE WGPASPHAAS FDIDWEMLPF RARGGVLLPI IGSSYGKTLE AGEIELRYDA ADGSFSAWYY EHRLPIAPQR YSEILRTIVR EAGAENSDAG RAILDLAARY TGLGHPSRKE APEFKAALQA IPGAAEVITR GLNAYRAGEG RIQQIQALHN LLERQHYKLG HWQLASSEIN YRRFFDVNTL AGLRVEDAGT FEAIHSRVKK LLADGQLQGL RLDHIDGLRD PAQYFQRLRR LARDAQGAGA PPLYTVIEKI LGEGEALHRF AGVHGTTGYE WLNVITRVLL DGRGLKPLDE TWRQASNLSP AFDPVLKAAK RRVLETLLLS EFTVLTRLLA RIASGHYSTR DFSADNLRQV FELYVLHFPV YRTYLTGNSP TQLDRKLIED TIAKARADWF GADDGIFEFL KDVLTMDLVK PGRALHSKPR VRRFALKVQQ FTGPTMAKSL EDTAFYRYHR LLALNEVGGD PAAPEMPIAA FHDAMQSRAK DWPHGMTATM THDAKRGEDA RARLLSLAEI PGEWASAVGK WKLLNAPHLV VDGDMRAPSP AFEYMLYQAL IGAWPLSPDP DFTDRFQGFA LKAAREGKQE TNWLNPNLAY EEGIRIFIDR LLDPKLSGPF LESVDSLHRR LSLLGALNGL SQLTLKATMP GVPDFYQGTE FWDFSLVDPD NRRPVDFDAR AAALESLDDK PDWKALTAKW SDGRVKLAWT HHLLKLRRDH AALFSDGDYR PLAVKGAHRD HIVAFARTSG SEAVIVVVAK GLAALSDEGR QWPTGDAFDG AIETKGYAVE IGDGETTSGE LQLRDLFRHL PVSVHRARLT GAKRVRRG
|
| |