Gene RPB_1883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1883 
Symbol 
ID3908078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2151185 
End bp2154487 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content65% 
IMG OID637883777 
Producttrehalose synthase 
Protein accessionYP_485502 
Protein GI86749006 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.12501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTGA TGTCGCCGAT CGACTCGACC GATTCCCGCG CCCAAGCCGA CGCCACCAAC 
GAGCTTTGGT ACAAGGACGC GATCATCTAC CAGCTCCACG TCAAGGCCTT CGCCGACAGC
AACAATGACG GCATCGGCGA CTTCGCCGGC CTCACCGAGA AGCTGGATTA CCTCCAGGAC
CTCGGCGTCA CCGCGCTGTG GCTGCTGCCG TTCTACCCCT CGCCGCAGCG CGACGACGGC
TACGACATCG CCGACTACGG CTCGATCAAT CCGGACTTCG GCACGATGAA GGACTTCCGC
CGCTTCATCG TCGAGGCCAA GAAGCGCAAT CTGCGCGTCA TCACCGAACT CGTCATCAAT
CACACCTCCG ACCAGCACGA CTGGTTCAAG CGGGCGCGAC GCAGCGGCAA GGGCTCCAGC
GCACGCGACT GGTACGTCTG GAGCGACAGC GACCAGAAAT ATCAGGGCAC CCGGATCATC
TTCACCGACA CCGAGAAGTC GAACTGGACC TGGGATCCGG AAGCCGGCCA GTACTACTGG
CACCGCTTTT TCTCGCACCA GCCCGACCTC AACTTCGACA ATCCGCACGT CGTCGGCGCG
GTCGTCAAGG TGATGAAGCG CTGGCTCGAT ACCGGCGTCG ACGGCTTCCG ACTCGATGCG
ATTCCCTATC TGTGCGAGCG CGACGGCACC AACAACGAAA ATCTCCCCGA GACCCACGCC
GTCATCAAGA CGCTGCGCGC GGAGCTCGAC GCCTACGCCA AGGGCAAGGT GCTGCTGGCC
GAGGCCAATC AGTGGCCGGA GGACGTGCAG GAATATTTCG GCGACAGTGA CGAGTGCCAC
ATGGCCTATC ACTTCCCGCT GATGCCGCGG ATCTACATGG CGATCGCCCA GGAGGATCGC
TTCCCGATCA CCGACATCAT GCGGCAGACC CCGGAAATTC CCGCGAACTG CCAGTGGGCG
ATGTTCCTGC GCAACCACGA CGAGCTGACG CTTGAAATGG TCACCGACGT CGAGCGCGAC
TATCTGTGGA CCACCTACGC GGCCGATCCG CGCGCGCGCA TCAACGTCGG CATTCGCCGC
AGGCTCGCGC CGCTGATGGA CAACGACCGC CGCAAGATCG AGCTGATGAA TTCGCTGCTG
CTGTCGTTTC CCGGCACGCC GATCATCTAC TACGGCGACG AAATCGGGAT GGGCGACAAC
ATCTATCTCG GCGACCGCAA CGGCGTGCGC ACGCCGATGC AATGGTCGTC GGATCGCAAC
GGCGGCTTCT CGCGAGCCGA TCCGGCGCGG CTCTACGCCC CGCCGATCAT GGACCCGGTC
TACGGCTATG CTTCGGTCAA CGTCGAGGCC CAGGCGCGCA GCCTGTCGTC GCTGTTGAGC
GCCACCAAGC GGCTGATCTC GGTCCGCAAA TCCACCCTCG CCTTCGGGCG TGGCACGATG
ACCTTCATCA GGCCGGTGAA CCGTTCGGTG CTGTCCTATG TCCGGCAGTA CGAGGACGAG
GTGATCCTCT GCGTCGCCAA TCTGTCGCGC TCGGCGCAGG CCACCGAGCT CGACCTGTCG
CCGTGGAAGG ATCGCGTGCC GCAGGAGATG CTCGGCCGCA CCAAATTTCC GGCGATCGGC
GAACTGCCCT ATATGATCAC GCTCGCGCCC TACGGCTTCT ATTGGTTCAA GCTCGAGGAG
CGCGACACAT CTGAGCACGT CGCGCCCGCC GCGACGGTGC CTGAGTTCGA GACCCTGGTG
GTGCCGCTGG GCTCGACCTG GATGACACTG GCGCGGACCC GCGGCGTGTT CGAGCGCGAC
GTGCTGCCGG CCTATCTGTC GCGAACCCGA TGGTTTCCGG AACGTTCGCC GCGCGCGATC
CAGCCGCATT TGACCTCGGC GATCCCCTTC TCGATCACGC ATGACAACCG GCCCTGGCTG
ACGTTCTTCG AAGCCACCGT GCGCGGCGTA AACACCCGCT ACGTGCTGCC GATGCAGATC
GACTGGGTCC GCTTCGATCG CGAGCGCTAC AATCCGCGCG CCTTCGCGGC GGTCCGCCAG
GGCGCGCGCG AAGGAACGCT GCTCGACGTC GCCGCCGACA CCGAATTCAC CACGCTGCTG
CTCGACAATC TGCGTGAATC GCTCGTCGTC GAGAACGACG GCGACCGGCT GGAATTCAGG
CCCGGCTCCC GACTCGCCGA CAAGCCGGCC GGTCCCTACA ACCACATTCG CGCGGTGGAC
ACCGAGCAGT CGAACTCGAC GGCGCTGGTC GACGAGAGTT ACGTCGTCAA GCTGTATCGC
CGGCTCGAGA GCGGCATCAA TCCCGAGATC GAGATGGGCC GCTTCCTCTC CGAGGTCGCC
GGCTATTCCA ACACCCCGTC GTTGCTCGGC AGTGTCGAAC TGGTCGAGGG CGACAAGGTC
AGCGCGATCG CGGTGGTGCA CGATTTCGTC GCCAATCAGG GCGACGGCTG GACCGTGACG
TCCGGCTATC TCGACCGCTA TGTCGACGAC CAGCGACTGC TGATCAATAC CGAGGAAGAT
AGCGCCAGCG ACGAACTCGC GCCGTATCTG CGCTACATGC AGCAGACCGG CAAGCGCGTC
GCCGAGATGC ACATCGCCCT CGCCGGCCAT CCCGAGGTCG ACGATTTCGC GCCGGTCCCG
ATTGCGGACG ACGATGCGCG GAGTTGGACC GAGGCCGTGA CGGCCAACGC CGGACGCGTG
CTCGACGAAC TGGCGCGGAA GCGCGACGGT CTCAGGGACG CCGACAGGGC CCTGATCGAC
GATCTGCTGG CGCAGCGCAA CGGCCTGTCG GAGGGGCTCC GTGGCCTTTT CGGCAGTGCC
GGCGGCCTGA AGATCCGGCA TCATGGCGAC TTCCACCTCG GCCAGATGCT GATCGTCAAG
GACGACATCT TCATCATCGA CTTCGAAGGA GAACCGCGGC GGTCCCAGGC CGAGCGGCGG
GCCAAGGCGC CGGCTGCGCG CGATGTCGCC GGACTGATCC GCTCGATCGA CTATTCCACG
ACCGCGGCGC TGGAGCGCGC GCAGAAGGCG CTGGTGGACG AGTCCGGCAA GATCGCGGCC
GCGCTCGATG TCTGGCGGAC GCGCTCGACG GAGGCGTTCC TGGCTGCCTA TCGCGAGACG
ATGGCCGACA GCCCGGTGTG GCCCGTGGAT CGTGCGGCAG CCGATCAGAT CCTGGACTTC
TTCCTGATCG AAAAGGCGCT ATACGAGATC GAATACGAAC TCGCCTATCG TCCCGATTGG
CTCCGCGTGC CGCTGGCTGG CATTCTTCGC ATCCTGACTC GGCAGCCCGA GGAGAATTCA
TGA
 
Protein sequence
MNVMSPIDST DSRAQADATN ELWYKDAIIY QLHVKAFADS NNDGIGDFAG LTEKLDYLQD 
LGVTALWLLP FYPSPQRDDG YDIADYGSIN PDFGTMKDFR RFIVEAKKRN LRVITELVIN
HTSDQHDWFK RARRSGKGSS ARDWYVWSDS DQKYQGTRII FTDTEKSNWT WDPEAGQYYW
HRFFSHQPDL NFDNPHVVGA VVKVMKRWLD TGVDGFRLDA IPYLCERDGT NNENLPETHA
VIKTLRAELD AYAKGKVLLA EANQWPEDVQ EYFGDSDECH MAYHFPLMPR IYMAIAQEDR
FPITDIMRQT PEIPANCQWA MFLRNHDELT LEMVTDVERD YLWTTYAADP RARINVGIRR
RLAPLMDNDR RKIELMNSLL LSFPGTPIIY YGDEIGMGDN IYLGDRNGVR TPMQWSSDRN
GGFSRADPAR LYAPPIMDPV YGYASVNVEA QARSLSSLLS ATKRLISVRK STLAFGRGTM
TFIRPVNRSV LSYVRQYEDE VILCVANLSR SAQATELDLS PWKDRVPQEM LGRTKFPAIG
ELPYMITLAP YGFYWFKLEE RDTSEHVAPA ATVPEFETLV VPLGSTWMTL ARTRGVFERD
VLPAYLSRTR WFPERSPRAI QPHLTSAIPF SITHDNRPWL TFFEATVRGV NTRYVLPMQI
DWVRFDRERY NPRAFAAVRQ GAREGTLLDV AADTEFTTLL LDNLRESLVV ENDGDRLEFR
PGSRLADKPA GPYNHIRAVD TEQSNSTALV DESYVVKLYR RLESGINPEI EMGRFLSEVA
GYSNTPSLLG SVELVEGDKV SAIAVVHDFV ANQGDGWTVT SGYLDRYVDD QRLLINTEED
SASDELAPYL RYMQQTGKRV AEMHIALAGH PEVDDFAPVP IADDDARSWT EAVTANAGRV
LDELARKRDG LRDADRALID DLLAQRNGLS EGLRGLFGSA GGLKIRHHGD FHLGQMLIVK
DDIFIIDFEG EPRRSQAERR AKAPAARDVA GLIRSIDYST TAALERAQKA LVDESGKIAA
ALDVWRTRST EAFLAAYRET MADSPVWPVD RAAADQILDF FLIEKALYEI EYELAYRPDW
LRVPLAGILR ILTRQPEENS