Gene RPD_3483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3483 
Symbol 
ID4023997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3867244 
End bp3870546 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content63% 
IMG OID637963687 
Producttrehalose synthase-like 
Protein accessionYP_570607 
Protein GI91977948 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGA TGTCAACGAT CGACGCGACC AATTCCAAGC CCGAGCTCGA GGCGACCGAC 
GAGCTGTGGT ACAAAGACGC GATCATCTAC CAGCTCCACG TCAAGGCGTT CGCCGACAGC
AACAATGACG GCATCGGCGA TTTCGCCGGC CTGACGGAGA AGCTGGACTA TCTGCAGGAC
CTCGGCGTCA CCGCGCTGTG GCTGCTGCCG TTCTATCCCT CGCCGCAGCG CGACGACGGC
TACGACATCG GCGATTACGG CTCGATCAAT CCCGACTTCG GCACGATGAA GGATTTCCGC
CGCTTCATCG TCGAGGCGAA GAAGCGCAAT CTGCGCGTCA TCACCGAGCT GGTCATCAAC
CACACGTCCG ACCAGCACGA CTGGTTCAAG CGCGCCCGGC GCAGCGAAAA GGGCTCGAGC
GCACGCAACT GGTATGTCTG GAGCGACACC GACCAGAAAT ATCAGGGCAC CCGGATCATC
TTCACCGACA CCGAGAAGTC GAACTGGACC TGGGATCCCG AGGCCGGCCA ATATTATTGG
CACCGCTTCT TTTCGCATCA GCCCGACCTG AATTTCGACA ATCCGCATGT CGTCAGCGCC
CTCGTCAAGG TGATGAAGCG CTGGCTCGAC ACCGGCGTCG ACGGCTTCCG GCTCGACGCA
ATTCCCTATC TGTGCGAGCG CGACGGCACC AACAACGAGA ACCTTCCCGA GACCCACGCC
GTCATCAAGC AGTTGCGCGC CGAGCTCGAT GCTTACGCCA AGGGCAAGGT GCTGCTGGCG
GAGGCCAATC AATGGCCGGA GGACGTCCAG GAATATTTCG GCCATAGCGA CGAATGTCAC
ATGGCCTATC ACTTCCCGCT GATGCCGCGG ATCTACATGG CGATCGCCCA GGAAGATCGC
TTTCCGATCA CCGACATCAT GCGGCAGACA CCGGAAATCC CCGCCAACTG TCAATGGGCG
ATGTTCCTGC GCAATCACGA CGAGCTGACG CTGGAAATGG TCACCGACGT CGAGCGCGAT
TATCTGTGGA AGACCTATGC GGCCGATCCG CGCGCTCGCA TCAATGTCGG CATCCGCCGC
CGCCTCGCGC CGCTGATGGA CAATGACCGC CGCAAGATCG AGCTGATGAA CTCGCTGCTG
CTGTCGTTCC CCGGCACGCC GATCATCTAT TACGGCGACG AAATCGGCAT GGGCGACAAC
ATCTATCTCG GCGACCGCAA CGGCGTCCGC ACGCCGATGC AATGGTCGTC GGATCGCAAT
GGCGGCTTCT CACGCGCCGA CCCGGCGCGA CTCTATGCCC CGACCATCAT GGACCCGGTC
TACGGCTACG CGTCGGTCAA TGTCGAGGCG CAGGCGCGCA GCCTCTCGTC GCTGCTCAGC
GCGACCAAGC GGCTGATTTC GGTCCGCAAA TCCACCTTCG CCTTCGGCCG CGGCACGATG
ACGTTCATCC GGCCCGCGAA CCGCTCGGTG CTCGCTTATG TCCGGCAGTA CGAAGACGAG
GTGATCCTCT GCGTCGCCAA TCTGTCGCGT TCGGCGCAGG CCACCGAACT CGATCTGTCG
CCGTGGAAGG ACCGCGTGCC GCAGGAAATG CTCGGACGCA CGCGTTTTCC GGCGATCGGC
GAACTGCCTT ACATGATCAC GCTGGCGCCC TACGGCTTCT ACTGGTTCAA GCTGGAGGAG
CGCGACACAT CCCAGCACGT CGCGCCCGCA GCCGCCGTGC CCGAGTTCGA AACCCTGGTG
GTGCCTGTCG GCGCAACATG GATGTCGCTC GCCCGCACCC GCGGCGTGTT CGAGCGCGAT
GTGCTGCCGG CGCATCTGTC GCGCACCCGG TGGTTTCCAG AGCGTTCGCC GCGTTCGATC
CATCCGCGAG TCACCTCGGC CATTCCATTC TCGAACACCC ATGAGAACCG CCCGTGGCTG
GCGTTCTTCG AGGCGACCGT GCGCGGCGTC GATGCCCGCT ATGTGCTGCC GATGCAGATC
GACTGGGTGC GGTTCGATCG CGAGCGCTAC AATCCGCACG CCTTCGCCGC CGTGCGTCAG
GGCGCACGCG AGGGCACGCT GCTCGATGTC GCCACCGACG TCGAATTCGT CACCCTGCTC
CTCGACAATC TGCGCGAGTC GGTCGTGGTG GAGAATGAGG GCGTCCGACT GCAATTCCGC
CCCGGCTCGC GTCTGGCCGA GAAGCCCGCC GTCGCGTATC ACGATATCCG CGCGGTCGAG
ACCGAGCAGT CGAATTCCAC GGCGCTGGTC GGCGAACACT ACGTCATCAA GCTGTATCGC
CGGCTGCAAA GCGGCATCAA TCCGGAAATC GAAATGGGCC GCTTCCTCAC CGAGGTCGCG
GGCTATACCA ACACCCCGTC TCTGCTCGGC AGCGTCGAAC TGGTCGAGGG CGACAAGGTC
AGCGCGATCG CGGTGGTGCA TGATTTCGTC GCCAACCAGG GCGACGGCTG GATCGTGACA
TCGGGTTATC TCGACCGCTA TGTCGACGAC CAGCGGCTGC TGATCAATTC CGAAGAACAG
CACGTCAGCG AGGAGCTGGC GCCGTACCGG CACTACATGC AGCAGACCGG CAAACGCGTC
GCCGAAATGC ACATCGCGCT GTCAAGCCAT CCGGAGCAGC CCGACTTTGC TCCCGCGCCG
ATCAGCAGCG ACGACGCCGA GCGCTGGACC GGTATCGTCA CCGAGAACGC CAAGCGGGTG
CTCGACGAGT TGCAGCACAA GCGCGAGAGC CTGAAAGACG CCGAACGGGC CATAGTGGAC
GAGCTGCTGG CGCAACGCGA CGGGATTTTC GAGCGGCTGC GCGGACTGTT CGGCAGCGAC
GGCGGCCTGA ACATCCGCCA TCATGGCGAC TTCCATCTCG GCCAGATGCT GATCGTCAAG
GACGACGTCT TCATCATCGA TTTCGAAGGC GAGCCGCGGC GGTCGCAGGC CGAGCGCCGC
GCCAAGGCAC CCGCCGCGCG CGACGTCGCG GGGCTGGTGC GCTCGATCGA CTATTCCACC
ACCGCAGCGC TGGAGCGCGC GCTGAAAGTC TCGACCGACG AAACCGGCAA GATCGCCGCG
GCACTCGATG GCTGGCGGAT TCGGTCGACC GAAGCGTTCC TCACCGCCTA CCGCGAAACG
ATGGGCGACA GCCTGGTCTG GCCCGCCGAT CGCGCCGCAG CCGATCGGTT ACTGGACTTT
TTCCTGATCG AGAAGGCATT GTATGAGATC GAATACGAAC TCGCCCACCG TCCCGACTGG
CTCCGCGTGC CGCTGGCTGG CATCCTTCGT ATCCTGACCC GGCAGCCCGA GGAGATTTCA
TGA
 
Protein sequence
MNLMSTIDAT NSKPELEATD ELWYKDAIIY QLHVKAFADS NNDGIGDFAG LTEKLDYLQD 
LGVTALWLLP FYPSPQRDDG YDIGDYGSIN PDFGTMKDFR RFIVEAKKRN LRVITELVIN
HTSDQHDWFK RARRSEKGSS ARNWYVWSDT DQKYQGTRII FTDTEKSNWT WDPEAGQYYW
HRFFSHQPDL NFDNPHVVSA LVKVMKRWLD TGVDGFRLDA IPYLCERDGT NNENLPETHA
VIKQLRAELD AYAKGKVLLA EANQWPEDVQ EYFGHSDECH MAYHFPLMPR IYMAIAQEDR
FPITDIMRQT PEIPANCQWA MFLRNHDELT LEMVTDVERD YLWKTYAADP RARINVGIRR
RLAPLMDNDR RKIELMNSLL LSFPGTPIIY YGDEIGMGDN IYLGDRNGVR TPMQWSSDRN
GGFSRADPAR LYAPTIMDPV YGYASVNVEA QARSLSSLLS ATKRLISVRK STFAFGRGTM
TFIRPANRSV LAYVRQYEDE VILCVANLSR SAQATELDLS PWKDRVPQEM LGRTRFPAIG
ELPYMITLAP YGFYWFKLEE RDTSQHVAPA AAVPEFETLV VPVGATWMSL ARTRGVFERD
VLPAHLSRTR WFPERSPRSI HPRVTSAIPF SNTHENRPWL AFFEATVRGV DARYVLPMQI
DWVRFDRERY NPHAFAAVRQ GAREGTLLDV ATDVEFVTLL LDNLRESVVV ENEGVRLQFR
PGSRLAEKPA VAYHDIRAVE TEQSNSTALV GEHYVIKLYR RLQSGINPEI EMGRFLTEVA
GYTNTPSLLG SVELVEGDKV SAIAVVHDFV ANQGDGWIVT SGYLDRYVDD QRLLINSEEQ
HVSEELAPYR HYMQQTGKRV AEMHIALSSH PEQPDFAPAP ISSDDAERWT GIVTENAKRV
LDELQHKRES LKDAERAIVD ELLAQRDGIF ERLRGLFGSD GGLNIRHHGD FHLGQMLIVK
DDVFIIDFEG EPRRSQAERR AKAPAARDVA GLVRSIDYST TAALERALKV STDETGKIAA
ALDGWRIRST EAFLTAYRET MGDSLVWPAD RAAADRLLDF FLIEKALYEI EYELAHRPDW
LRVPLAGILR ILTRQPEEIS