Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1883 |
Symbol | |
ID | 3908078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2151185 |
End bp | 2154487 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637883777 |
Product | trehalose synthase |
Protein accession | YP_485502 |
Protein GI | 86749006 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.12501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTGA TGTCGCCGAT CGACTCGACC GATTCCCGCG CCCAAGCCGA CGCCACCAAC GAGCTTTGGT ACAAGGACGC GATCATCTAC CAGCTCCACG TCAAGGCCTT CGCCGACAGC AACAATGACG GCATCGGCGA CTTCGCCGGC CTCACCGAGA AGCTGGATTA CCTCCAGGAC CTCGGCGTCA CCGCGCTGTG GCTGCTGCCG TTCTACCCCT CGCCGCAGCG CGACGACGGC TACGACATCG CCGACTACGG CTCGATCAAT CCGGACTTCG GCACGATGAA GGACTTCCGC CGCTTCATCG TCGAGGCCAA GAAGCGCAAT CTGCGCGTCA TCACCGAACT CGTCATCAAT CACACCTCCG ACCAGCACGA CTGGTTCAAG CGGGCGCGAC GCAGCGGCAA GGGCTCCAGC GCACGCGACT GGTACGTCTG GAGCGACAGC GACCAGAAAT ATCAGGGCAC CCGGATCATC TTCACCGACA CCGAGAAGTC GAACTGGACC TGGGATCCGG AAGCCGGCCA GTACTACTGG CACCGCTTTT TCTCGCACCA GCCCGACCTC AACTTCGACA ATCCGCACGT CGTCGGCGCG GTCGTCAAGG TGATGAAGCG CTGGCTCGAT ACCGGCGTCG ACGGCTTCCG ACTCGATGCG ATTCCCTATC TGTGCGAGCG CGACGGCACC AACAACGAAA ATCTCCCCGA GACCCACGCC GTCATCAAGA CGCTGCGCGC GGAGCTCGAC GCCTACGCCA AGGGCAAGGT GCTGCTGGCC GAGGCCAATC AGTGGCCGGA GGACGTGCAG GAATATTTCG GCGACAGTGA CGAGTGCCAC ATGGCCTATC ACTTCCCGCT GATGCCGCGG ATCTACATGG CGATCGCCCA GGAGGATCGC TTCCCGATCA CCGACATCAT GCGGCAGACC CCGGAAATTC CCGCGAACTG CCAGTGGGCG ATGTTCCTGC GCAACCACGA CGAGCTGACG CTTGAAATGG TCACCGACGT CGAGCGCGAC TATCTGTGGA CCACCTACGC GGCCGATCCG CGCGCGCGCA TCAACGTCGG CATTCGCCGC AGGCTCGCGC CGCTGATGGA CAACGACCGC CGCAAGATCG AGCTGATGAA TTCGCTGCTG CTGTCGTTTC CCGGCACGCC GATCATCTAC TACGGCGACG AAATCGGGAT GGGCGACAAC ATCTATCTCG GCGACCGCAA CGGCGTGCGC ACGCCGATGC AATGGTCGTC GGATCGCAAC GGCGGCTTCT CGCGAGCCGA TCCGGCGCGG CTCTACGCCC CGCCGATCAT GGACCCGGTC TACGGCTATG CTTCGGTCAA CGTCGAGGCC CAGGCGCGCA GCCTGTCGTC GCTGTTGAGC GCCACCAAGC GGCTGATCTC GGTCCGCAAA TCCACCCTCG CCTTCGGGCG TGGCACGATG ACCTTCATCA GGCCGGTGAA CCGTTCGGTG CTGTCCTATG TCCGGCAGTA CGAGGACGAG GTGATCCTCT GCGTCGCCAA TCTGTCGCGC TCGGCGCAGG CCACCGAGCT CGACCTGTCG CCGTGGAAGG ATCGCGTGCC GCAGGAGATG CTCGGCCGCA CCAAATTTCC GGCGATCGGC GAACTGCCCT ATATGATCAC GCTCGCGCCC TACGGCTTCT ATTGGTTCAA GCTCGAGGAG CGCGACACAT CTGAGCACGT CGCGCCCGCC GCGACGGTGC CTGAGTTCGA GACCCTGGTG GTGCCGCTGG GCTCGACCTG GATGACACTG GCGCGGACCC GCGGCGTGTT CGAGCGCGAC GTGCTGCCGG CCTATCTGTC GCGAACCCGA TGGTTTCCGG AACGTTCGCC GCGCGCGATC CAGCCGCATT TGACCTCGGC GATCCCCTTC TCGATCACGC ATGACAACCG GCCCTGGCTG ACGTTCTTCG AAGCCACCGT GCGCGGCGTA AACACCCGCT ACGTGCTGCC GATGCAGATC GACTGGGTCC GCTTCGATCG CGAGCGCTAC AATCCGCGCG CCTTCGCGGC GGTCCGCCAG GGCGCGCGCG AAGGAACGCT GCTCGACGTC GCCGCCGACA CCGAATTCAC CACGCTGCTG CTCGACAATC TGCGTGAATC GCTCGTCGTC GAGAACGACG GCGACCGGCT GGAATTCAGG CCCGGCTCCC GACTCGCCGA CAAGCCGGCC GGTCCCTACA ACCACATTCG CGCGGTGGAC ACCGAGCAGT CGAACTCGAC GGCGCTGGTC GACGAGAGTT ACGTCGTCAA GCTGTATCGC CGGCTCGAGA GCGGCATCAA TCCCGAGATC GAGATGGGCC GCTTCCTCTC CGAGGTCGCC GGCTATTCCA ACACCCCGTC GTTGCTCGGC AGTGTCGAAC TGGTCGAGGG CGACAAGGTC AGCGCGATCG CGGTGGTGCA CGATTTCGTC GCCAATCAGG GCGACGGCTG GACCGTGACG TCCGGCTATC TCGACCGCTA TGTCGACGAC CAGCGACTGC TGATCAATAC CGAGGAAGAT AGCGCCAGCG ACGAACTCGC GCCGTATCTG CGCTACATGC AGCAGACCGG CAAGCGCGTC GCCGAGATGC ACATCGCCCT CGCCGGCCAT CCCGAGGTCG ACGATTTCGC GCCGGTCCCG ATTGCGGACG ACGATGCGCG GAGTTGGACC GAGGCCGTGA CGGCCAACGC CGGACGCGTG CTCGACGAAC TGGCGCGGAA GCGCGACGGT CTCAGGGACG CCGACAGGGC CCTGATCGAC GATCTGCTGG CGCAGCGCAA CGGCCTGTCG GAGGGGCTCC GTGGCCTTTT CGGCAGTGCC GGCGGCCTGA AGATCCGGCA TCATGGCGAC TTCCACCTCG GCCAGATGCT GATCGTCAAG GACGACATCT TCATCATCGA CTTCGAAGGA GAACCGCGGC GGTCCCAGGC CGAGCGGCGG GCCAAGGCGC CGGCTGCGCG CGATGTCGCC GGACTGATCC GCTCGATCGA CTATTCCACG ACCGCGGCGC TGGAGCGCGC GCAGAAGGCG CTGGTGGACG AGTCCGGCAA GATCGCGGCC GCGCTCGATG TCTGGCGGAC GCGCTCGACG GAGGCGTTCC TGGCTGCCTA TCGCGAGACG ATGGCCGACA GCCCGGTGTG GCCCGTGGAT CGTGCGGCAG CCGATCAGAT CCTGGACTTC TTCCTGATCG AAAAGGCGCT ATACGAGATC GAATACGAAC TCGCCTATCG TCCCGATTGG CTCCGCGTGC CGCTGGCTGG CATTCTTCGC ATCCTGACTC GGCAGCCCGA GGAGAATTCA TGA
|
Protein sequence | MNVMSPIDST DSRAQADATN ELWYKDAIIY QLHVKAFADS NNDGIGDFAG LTEKLDYLQD LGVTALWLLP FYPSPQRDDG YDIADYGSIN PDFGTMKDFR RFIVEAKKRN LRVITELVIN HTSDQHDWFK RARRSGKGSS ARDWYVWSDS DQKYQGTRII FTDTEKSNWT WDPEAGQYYW HRFFSHQPDL NFDNPHVVGA VVKVMKRWLD TGVDGFRLDA IPYLCERDGT NNENLPETHA VIKTLRAELD AYAKGKVLLA EANQWPEDVQ EYFGDSDECH MAYHFPLMPR IYMAIAQEDR FPITDIMRQT PEIPANCQWA MFLRNHDELT LEMVTDVERD YLWTTYAADP RARINVGIRR RLAPLMDNDR RKIELMNSLL LSFPGTPIIY YGDEIGMGDN IYLGDRNGVR TPMQWSSDRN GGFSRADPAR LYAPPIMDPV YGYASVNVEA QARSLSSLLS ATKRLISVRK STLAFGRGTM TFIRPVNRSV LSYVRQYEDE VILCVANLSR SAQATELDLS PWKDRVPQEM LGRTKFPAIG ELPYMITLAP YGFYWFKLEE RDTSEHVAPA ATVPEFETLV VPLGSTWMTL ARTRGVFERD VLPAYLSRTR WFPERSPRAI QPHLTSAIPF SITHDNRPWL TFFEATVRGV NTRYVLPMQI DWVRFDRERY NPRAFAAVRQ GAREGTLLDV AADTEFTTLL LDNLRESLVV ENDGDRLEFR PGSRLADKPA GPYNHIRAVD TEQSNSTALV DESYVVKLYR RLESGINPEI EMGRFLSEVA GYSNTPSLLG SVELVEGDKV SAIAVVHDFV ANQGDGWTVT SGYLDRYVDD QRLLINTEED SASDELAPYL RYMQQTGKRV AEMHIALAGH PEVDDFAPVP IADDDARSWT EAVTANAGRV LDELARKRDG LRDADRALID DLLAQRNGLS EGLRGLFGSA GGLKIRHHGD FHLGQMLIVK DDIFIIDFEG EPRRSQAERR AKAPAARDVA GLIRSIDYST TAALERAQKA LVDESGKIAA ALDVWRTRST EAFLAAYRET MADSPVWPVD RAAADQILDF FLIEKALYEI EYELAYRPDW LRVPLAGILR ILTRQPEENS
|
| |