Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1879 |
Symbol | |
ID | 3908074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2142414 |
End bp | 2145200 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883773 |
Product | malto-oligosyltrehalose synthase |
Protein accession | YP_485498 |
Protein GI | 86749002 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCCG CGATCCCGAC TGCGACTTAC CGTATCCAGC TCACCGCCGC TTTCGGCTTC GACGATGCGG CCGCGATCGT GCCGTATCTC AAGGCGCTCG GGATTTCGCA TCTCTACGCG TCGCCCTTCA CCAAGGCGCG CCGCGGATCG ACCCACGGCT ACGACATCGT CGATCACACC ACGCTCAACC CCGAACTCGG CGGCGAAGAG GCGTTCGCGC GACTGTCCGC GGCGCTGAAG AGCCACGATA TCGGCCTGAT CCTCGACTTC GTCCCCAACC ATGTCGGCGT TCACTTCGCC GACAATCCAT GGTGGCTGGA CGTTCTGGAA TGGGGCCCGG CATCGCCGCA TGCCGCCTCG TTCGACATCG ACTGGGAGAT GCTGCCGTTC CGCAACCGCG GCGGGGTGTT GCTGCCGATC ATCGGAACCT CCTACGGCAA GGCGCTGGAG AGCGGCGAGA TCGGGCTACG CTACGACGCC GGCGACGGCA GTTTCTCGGC TTGGTACTTC GAACACCGGT TGCCGATCGC GCCGCAGCGC TACAGCGAGA TCCTGCGTAC GATCGTACGC GAGGCCGATG CCACCGATCA TCCCGCCGGC AAGGCGATCC TCGCGCTCGC CGCGCGCTAT CGCGGATTGC GCCACCCGGA TCGCAAGGAA GCGCCGGACT TCAAGGCGGC GCTGAAGGCT GTTCCGGGCA GCGCCGACCT GATCGACAAG GGCCTCGCCG CCTATCGGGC CGGCGAAGGC CGCAATACGC AGATTCAGGC GCTGCACAAT CTGCTCGAAC GCCAGCACTA CAAGCTCGGC CATTGGCAAC TCGCCGCGAG CGAGATCAAC TATCGCCGCT TCTTCGACGT CAACACCCTC GCCGGCTTGC GCGTCGAGGA CGCCGGCACG TTCGAGGGGA TCCACACGCT GGTGAAGCGG CTGATCGCCA ACGGTCAGCT ACAGGGCCTG CGGCTCGACC ACATCGACGG CCTGCGCGAC CCTGCGCAAT ATTTCCAGCG CCTGCGCCGG CTCACCCGCG AGGCGCAGGG GCCTGCCGCG CCGCCGCTCT ACATGGTGAT CGAGAAGATC CTCGGCGACG GCGAGCCGTT GCGGCGCTTC GCCGGTGTCC ACGGCACCAC CGGCTACGAA TGGCTGAATG TCATCACCCA GGCGCTGGTC GACGGCGCAG GCCTGCAGCC GCTCGACGAG GTCTGGCGGC AGGTGAGCAA CACCTCGCCG GATTTTCCGC CGGTGCTGAT GCGCGCCAAG CGCCGCGTGC TGGAGACGCT GCTGCTCAGC GAATTCACCG TGCTGACGCG GCTGCTGGCC CGGATCGCCA GCGGGCACTA TTCGACGCGC GATTTCTCCG CCGACAATCT GCGGCAGGTG TTCGAACTCT ACGTGCTGCA CTTCCCGGTG TATCGCACCT ATATCAGCGG GTCCGGCCCG AACGGACCCG ATCGCGAACT GATCGCCCAG ACCATCGAGA AGGCGCGCGC CGACTGGTTC GGCGCGGACG ACGGCATTTT CGACTTCCTG CAGGACGCGC TGACGATGGA CCTGCTGAAG GGCCGGGCCG CGCACAGCAA GCCGCGGGTG CGCCGCTTCG CGCTCAAGGT CCAGCAATTC ACCGGGCCGA CCATGGCGAA GTCGCTAGAG GACACCGCCT TCTATCGCTA CCATCGCCTG CTCGCGCTCA ACGAGGTCGG CGGCGAACCC GCCGCGCACG CGCTGGCGCC GGATGCCTTC CATCAGCTGA TGACGCAGCG GGCGCAGGAC TGGCCGCACG GCATGACCGC GACCATGACC CACGACGCCA AGCGCGGCGA AGACGCGCGG ACGCGGCTGC TGGCGCTGGC GGAGATGCCG GGCGAATGGG CCAGCCTGGT CGCCAAATGG AAGCTGCTGA ACGCGGCCCA TCTGGTGACG GACGGCGCGA TGCGGGCCCC GTCGGCGACG TTCGAATACA TGCTGTATCA GGGCCTGCTC GGCGCCTGGC CGCTCGAACC CGACGCCGAC TTCACCGACC GGATTCAGGG CTACGCGCTG AAGGCCGCGC GCGAAGGCAA AGAAGAGACC AACTGGATCA ACCCGAACCT CGCTTATGAG GAAGGCATCC GCATCTTCGT CGACCGCATT CTCGACCCCG CCCAGTCCGG CGCGTTCCTC GACTCGCTGC AGCGCGCGTC CGAGCGCGTC TCCGTGATCG GCGCGCTGAA TTCGCTGAGC CAGGTCACGC TGAAGACGAT GATGCCCGGC GTCCCTGATT TGTATCAGGG CACGGAGTTC TGGGACTTCT CTCTGGTCGA TCCCGACAAT CGCCGCCCGG TCGATTTTGC CGCCCGCGAA AAGGCGCTCG CAGCGCTTGC CGAGCCGGAC TGGGACGCGC TGCTGCGCAA CTGGAGCGAC GGCCGCGTCA AGCTGGCCTG GACCCGGCAG TTGCTCGCGA TCCGCAACGA ACTCCGCAGC GTGTTCACCG ACGGGGATTA TCGGCCGCTC GCGATCTCGG GTCCGCATCG CGACCATGCG ATCGCTTTCG CCCGCACCCG CGGCGCTCAG GCCGTGATCG TGGTAGTTGG AAAGAACTTC GCACCGCTGT CGGACAACGG CCGGCGATGG CCGCGCGGCG ACGCGTTCGA CGCCACAGTG GATGTTTCCG GATTCACCGT CGAAGGCACG ACCGGAAGCG ACGTGAAGCT CTCCGAGCTG TTTCGCAATC TGCCGGTCGC GGTCCGTCAG GCGCGCCTGA GCAATCTCGC CGGCCCGGCT CGGCCACGCC GAAAGACAAG CGCCTGA
|
Protein sequence | MPPAIPTATY RIQLTAAFGF DDAAAIVPYL KALGISHLYA SPFTKARRGS THGYDIVDHT TLNPELGGEE AFARLSAALK SHDIGLILDF VPNHVGVHFA DNPWWLDVLE WGPASPHAAS FDIDWEMLPF RNRGGVLLPI IGTSYGKALE SGEIGLRYDA GDGSFSAWYF EHRLPIAPQR YSEILRTIVR EADATDHPAG KAILALAARY RGLRHPDRKE APDFKAALKA VPGSADLIDK GLAAYRAGEG RNTQIQALHN LLERQHYKLG HWQLAASEIN YRRFFDVNTL AGLRVEDAGT FEGIHTLVKR LIANGQLQGL RLDHIDGLRD PAQYFQRLRR LTREAQGPAA PPLYMVIEKI LGDGEPLRRF AGVHGTTGYE WLNVITQALV DGAGLQPLDE VWRQVSNTSP DFPPVLMRAK RRVLETLLLS EFTVLTRLLA RIASGHYSTR DFSADNLRQV FELYVLHFPV YRTYISGSGP NGPDRELIAQ TIEKARADWF GADDGIFDFL QDALTMDLLK GRAAHSKPRV RRFALKVQQF TGPTMAKSLE DTAFYRYHRL LALNEVGGEP AAHALAPDAF HQLMTQRAQD WPHGMTATMT HDAKRGEDAR TRLLALAEMP GEWASLVAKW KLLNAAHLVT DGAMRAPSAT FEYMLYQGLL GAWPLEPDAD FTDRIQGYAL KAAREGKEET NWINPNLAYE EGIRIFVDRI LDPAQSGAFL DSLQRASERV SVIGALNSLS QVTLKTMMPG VPDLYQGTEF WDFSLVDPDN RRPVDFAARE KALAALAEPD WDALLRNWSD GRVKLAWTRQ LLAIRNELRS VFTDGDYRPL AISGPHRDHA IAFARTRGAQ AVIVVVGKNF APLSDNGRRW PRGDAFDATV DVSGFTVEGT TGSDVKLSEL FRNLPVAVRQ ARLSNLAGPA RPRRKTSA
|
| |