Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3483 |
Symbol | |
ID | 4023997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3867244 |
End bp | 3870546 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637963687 |
Product | trehalose synthase-like |
Protein accession | YP_570607 |
Protein GI | 91977948 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTGA TGTCAACGAT CGACGCGACC AATTCCAAGC CCGAGCTCGA GGCGACCGAC GAGCTGTGGT ACAAAGACGC GATCATCTAC CAGCTCCACG TCAAGGCGTT CGCCGACAGC AACAATGACG GCATCGGCGA TTTCGCCGGC CTGACGGAGA AGCTGGACTA TCTGCAGGAC CTCGGCGTCA CCGCGCTGTG GCTGCTGCCG TTCTATCCCT CGCCGCAGCG CGACGACGGC TACGACATCG GCGATTACGG CTCGATCAAT CCCGACTTCG GCACGATGAA GGATTTCCGC CGCTTCATCG TCGAGGCGAA GAAGCGCAAT CTGCGCGTCA TCACCGAGCT GGTCATCAAC CACACGTCCG ACCAGCACGA CTGGTTCAAG CGCGCCCGGC GCAGCGAAAA GGGCTCGAGC GCACGCAACT GGTATGTCTG GAGCGACACC GACCAGAAAT ATCAGGGCAC CCGGATCATC TTCACCGACA CCGAGAAGTC GAACTGGACC TGGGATCCCG AGGCCGGCCA ATATTATTGG CACCGCTTCT TTTCGCATCA GCCCGACCTG AATTTCGACA ATCCGCATGT CGTCAGCGCC CTCGTCAAGG TGATGAAGCG CTGGCTCGAC ACCGGCGTCG ACGGCTTCCG GCTCGACGCA ATTCCCTATC TGTGCGAGCG CGACGGCACC AACAACGAGA ACCTTCCCGA GACCCACGCC GTCATCAAGC AGTTGCGCGC CGAGCTCGAT GCTTACGCCA AGGGCAAGGT GCTGCTGGCG GAGGCCAATC AATGGCCGGA GGACGTCCAG GAATATTTCG GCCATAGCGA CGAATGTCAC ATGGCCTATC ACTTCCCGCT GATGCCGCGG ATCTACATGG CGATCGCCCA GGAAGATCGC TTTCCGATCA CCGACATCAT GCGGCAGACA CCGGAAATCC CCGCCAACTG TCAATGGGCG ATGTTCCTGC GCAATCACGA CGAGCTGACG CTGGAAATGG TCACCGACGT CGAGCGCGAT TATCTGTGGA AGACCTATGC GGCCGATCCG CGCGCTCGCA TCAATGTCGG CATCCGCCGC CGCCTCGCGC CGCTGATGGA CAATGACCGC CGCAAGATCG AGCTGATGAA CTCGCTGCTG CTGTCGTTCC CCGGCACGCC GATCATCTAT TACGGCGACG AAATCGGCAT GGGCGACAAC ATCTATCTCG GCGACCGCAA CGGCGTCCGC ACGCCGATGC AATGGTCGTC GGATCGCAAT GGCGGCTTCT CACGCGCCGA CCCGGCGCGA CTCTATGCCC CGACCATCAT GGACCCGGTC TACGGCTACG CGTCGGTCAA TGTCGAGGCG CAGGCGCGCA GCCTCTCGTC GCTGCTCAGC GCGACCAAGC GGCTGATTTC GGTCCGCAAA TCCACCTTCG CCTTCGGCCG CGGCACGATG ACGTTCATCC GGCCCGCGAA CCGCTCGGTG CTCGCTTATG TCCGGCAGTA CGAAGACGAG GTGATCCTCT GCGTCGCCAA TCTGTCGCGT TCGGCGCAGG CCACCGAACT CGATCTGTCG CCGTGGAAGG ACCGCGTGCC GCAGGAAATG CTCGGACGCA CGCGTTTTCC GGCGATCGGC GAACTGCCTT ACATGATCAC GCTGGCGCCC TACGGCTTCT ACTGGTTCAA GCTGGAGGAG CGCGACACAT CCCAGCACGT CGCGCCCGCA GCCGCCGTGC CCGAGTTCGA AACCCTGGTG GTGCCTGTCG GCGCAACATG GATGTCGCTC GCCCGCACCC GCGGCGTGTT CGAGCGCGAT GTGCTGCCGG CGCATCTGTC GCGCACCCGG TGGTTTCCAG AGCGTTCGCC GCGTTCGATC CATCCGCGAG TCACCTCGGC CATTCCATTC TCGAACACCC ATGAGAACCG CCCGTGGCTG GCGTTCTTCG AGGCGACCGT GCGCGGCGTC GATGCCCGCT ATGTGCTGCC GATGCAGATC GACTGGGTGC GGTTCGATCG CGAGCGCTAC AATCCGCACG CCTTCGCCGC CGTGCGTCAG GGCGCACGCG AGGGCACGCT GCTCGATGTC GCCACCGACG TCGAATTCGT CACCCTGCTC CTCGACAATC TGCGCGAGTC GGTCGTGGTG GAGAATGAGG GCGTCCGACT GCAATTCCGC CCCGGCTCGC GTCTGGCCGA GAAGCCCGCC GTCGCGTATC ACGATATCCG CGCGGTCGAG ACCGAGCAGT CGAATTCCAC GGCGCTGGTC GGCGAACACT ACGTCATCAA GCTGTATCGC CGGCTGCAAA GCGGCATCAA TCCGGAAATC GAAATGGGCC GCTTCCTCAC CGAGGTCGCG GGCTATACCA ACACCCCGTC TCTGCTCGGC AGCGTCGAAC TGGTCGAGGG CGACAAGGTC AGCGCGATCG CGGTGGTGCA TGATTTCGTC GCCAACCAGG GCGACGGCTG GATCGTGACA TCGGGTTATC TCGACCGCTA TGTCGACGAC CAGCGGCTGC TGATCAATTC CGAAGAACAG CACGTCAGCG AGGAGCTGGC GCCGTACCGG CACTACATGC AGCAGACCGG CAAACGCGTC GCCGAAATGC ACATCGCGCT GTCAAGCCAT CCGGAGCAGC CCGACTTTGC TCCCGCGCCG ATCAGCAGCG ACGACGCCGA GCGCTGGACC GGTATCGTCA CCGAGAACGC CAAGCGGGTG CTCGACGAGT TGCAGCACAA GCGCGAGAGC CTGAAAGACG CCGAACGGGC CATAGTGGAC GAGCTGCTGG CGCAACGCGA CGGGATTTTC GAGCGGCTGC GCGGACTGTT CGGCAGCGAC GGCGGCCTGA ACATCCGCCA TCATGGCGAC TTCCATCTCG GCCAGATGCT GATCGTCAAG GACGACGTCT TCATCATCGA TTTCGAAGGC GAGCCGCGGC GGTCGCAGGC CGAGCGCCGC GCCAAGGCAC CCGCCGCGCG CGACGTCGCG GGGCTGGTGC GCTCGATCGA CTATTCCACC ACCGCAGCGC TGGAGCGCGC GCTGAAAGTC TCGACCGACG AAACCGGCAA GATCGCCGCG GCACTCGATG GCTGGCGGAT TCGGTCGACC GAAGCGTTCC TCACCGCCTA CCGCGAAACG ATGGGCGACA GCCTGGTCTG GCCCGCCGAT CGCGCCGCAG CCGATCGGTT ACTGGACTTT TTCCTGATCG AGAAGGCATT GTATGAGATC GAATACGAAC TCGCCCACCG TCCCGACTGG CTCCGCGTGC CGCTGGCTGG CATCCTTCGT ATCCTGACCC GGCAGCCCGA GGAGATTTCA TGA
|
Protein sequence | MNLMSTIDAT NSKPELEATD ELWYKDAIIY QLHVKAFADS NNDGIGDFAG LTEKLDYLQD LGVTALWLLP FYPSPQRDDG YDIGDYGSIN PDFGTMKDFR RFIVEAKKRN LRVITELVIN HTSDQHDWFK RARRSEKGSS ARNWYVWSDT DQKYQGTRII FTDTEKSNWT WDPEAGQYYW HRFFSHQPDL NFDNPHVVSA LVKVMKRWLD TGVDGFRLDA IPYLCERDGT NNENLPETHA VIKQLRAELD AYAKGKVLLA EANQWPEDVQ EYFGHSDECH MAYHFPLMPR IYMAIAQEDR FPITDIMRQT PEIPANCQWA MFLRNHDELT LEMVTDVERD YLWKTYAADP RARINVGIRR RLAPLMDNDR RKIELMNSLL LSFPGTPIIY YGDEIGMGDN IYLGDRNGVR TPMQWSSDRN GGFSRADPAR LYAPTIMDPV YGYASVNVEA QARSLSSLLS ATKRLISVRK STFAFGRGTM TFIRPANRSV LAYVRQYEDE VILCVANLSR SAQATELDLS PWKDRVPQEM LGRTRFPAIG ELPYMITLAP YGFYWFKLEE RDTSQHVAPA AAVPEFETLV VPVGATWMSL ARTRGVFERD VLPAHLSRTR WFPERSPRSI HPRVTSAIPF SNTHENRPWL AFFEATVRGV DARYVLPMQI DWVRFDRERY NPHAFAAVRQ GAREGTLLDV ATDVEFVTLL LDNLRESVVV ENEGVRLQFR PGSRLAEKPA VAYHDIRAVE TEQSNSTALV GEHYVIKLYR RLQSGINPEI EMGRFLTEVA GYTNTPSLLG SVELVEGDKV SAIAVVHDFV ANQGDGWIVT SGYLDRYVDD QRLLINSEEQ HVSEELAPYR HYMQQTGKRV AEMHIALSSH PEQPDFAPAP ISSDDAERWT GIVTENAKRV LDELQHKRES LKDAERAIVD ELLAQRDGIF ERLRGLFGSD GGLNIRHHGD FHLGQMLIVK DDVFIIDFEG EPRRSQAERR AKAPAARDVA GLVRSIDYST TAALERALKV STDETGKIAA ALDGWRIRST EAFLTAYRET MGDSLVWPAD RAAADRLLDF FLIEKALYEI EYELAHRPDW LRVPLAGILR ILTRQPEEIS
|
| |