Gene RPB_1880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1880 
Symbol 
ID3908075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2145200 
End bp2146963 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content66% 
IMG OID637883774 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_485499 
Protein GI86749003 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGGC GGAGCTTCGG CCCGCGGCTG ACCGAGAACG GCGCCGAATT CCGGCTGTGG 
GCTCCCAACG CAACGCGCGT CGACGTCGTG CTCGATCGTC CGCATGCGAT GCAACGCGAC
AAGGACGGCT GGTTCAAGGT CGAGATCGAC GGGGCGCGCG GCGGCACGCG CTATCGCTTC
CGCATCGACG ATACGACCGA CGTTCCTGAT CCCGCATCCG GCTTCCAGCC CGAGGATATT
CAGGGCCCGA GCGAAGTGAT CGATCATGCG GCCTATCCGT GGCGCGCCGG CGAATGGCGT
GGTCGGCCGT GGCACGAGAC GGTGCTGCTG GAAGCGCATG TCGGCAGCTT CACGCCGCAG
GGGACGTTCC GCGCGATGAT CGACCGGCTC GATCATCTGG TCGACACCGG CGTGACCGCG
CTGGAGCTGA TGCCGCTGGC GGATTTTCCC GGGCGCCGCA ATTGGGGTTA TGACGGCGTG
CTGTGGTACG CACCCGACAG CGCCTATGGG CGGCCGGAGG ATCTCAAGGC GCTGATCGAC
GCGGCGCACG AGCGCGGCCT GATGATGTTC CTCGACGTCG TCTACAATCA TTTCGGCCCC
GAAGGAAACT ACATCGGCCA ATACGCGCCG CCGTTCTTCT CGGATTCGCA CACGCCGTGG
GGCAACGGGA TCAATTACGA CGTCGAACAG GTCCGGGCCT TCGCGATCGA GAATGCCGTG
TACTGGCTAC GCGAATATCA TTTCGACGGG TTGCGCCTCG ATGCCGTGCA CGCCATCCCG
GATCAGGGCG AAATCCCGAT GCTGCACGAA CTGAGCCGGG AAGTCGGCAA GCTCGCCGCA
GAGACCGGCC GCCACATCCA CCTGGTGCTC GAAAACGACG ACAACATCGC CGCCGTCCTC
GATCCTGTCG TCGATCCCCC GCGCGGACAG TATCGCGCGC AATGGAACGA CGACTATCAC
CACGCCTGGC ACGTCGCCCT GACCGGCGAG AAGCAGGGGT ACTATTCCGA CTATGCAGAC
GCGCCGCTGA ACGCCATCGC CCGCGCGCTG GGCTCGGGCT TCGTCTATCA GGGCGAGCCG
TCCGGGCATC GCGGCGGTCA GCCGCGCGGC GAGCCCAGCG GCAAGCTGCC GCCGCTCGCT
TTCGTCAATT TCCTGCAGAA CCACGATCAG ATCGGCAATC GCGCGCTCGG CGACAGGCTG
GAAAGCCTGG CGAAGCCGAA GGCGGTCGAA GCCGCGCTCG CGATCACGCT GCTGGCGCCG
ACGATCCCGA TGTTGTTCAT GGGCGAGGAA TGGGGATCGC AGGCGCCGTT TCCGTTCTTC
TGCGATTTCC ACGGCGAACT CGCCGATGCC GTCCGCCGGG GTCGACGCAA GGAATTCGCC
GGCGCCTACG AGACATACGG CGACGAGGTC CCCGACCCGC TCGACGAATC GACCTTCAAG
AGCGCGGTCA TCGACTGGAG CGAGCGCGAC GACGGCCGCG GCGCAGCGCG GCTGGCGCTC
GTGAAGCGGC TGCTCGACAT CCGGCGCAAC ACTCTCGTGC CGCGCCTGCC AGGCGCCCGC
TTCGGCAATG CCGAGATCGC CGAGGACGGC CTGCTCCGCG CCCGCTGGCG GCTCGGAGAC
GGCGCCACGC TCAAGCTCGT CGCCAATCTG TCGGACCACG ACGTCGCCTT CAAGGCACCG
CCCGATGGAA CTAATGTGTG GGGCGACGAT TGGAACGGGA TGATTCCGCC GTGGGCGGTG
ACCTGGCGCC TCGAAGAGTC CTGA
 
Protein sequence
MKGRSFGPRL TENGAEFRLW APNATRVDVV LDRPHAMQRD KDGWFKVEID GARGGTRYRF 
RIDDTTDVPD PASGFQPEDI QGPSEVIDHA AYPWRAGEWR GRPWHETVLL EAHVGSFTPQ
GTFRAMIDRL DHLVDTGVTA LELMPLADFP GRRNWGYDGV LWYAPDSAYG RPEDLKALID
AAHERGLMMF LDVVYNHFGP EGNYIGQYAP PFFSDSHTPW GNGINYDVEQ VRAFAIENAV
YWLREYHFDG LRLDAVHAIP DQGEIPMLHE LSREVGKLAA ETGRHIHLVL ENDDNIAAVL
DPVVDPPRGQ YRAQWNDDYH HAWHVALTGE KQGYYSDYAD APLNAIARAL GSGFVYQGEP
SGHRGGQPRG EPSGKLPPLA FVNFLQNHDQ IGNRALGDRL ESLAKPKAVE AALAITLLAP
TIPMLFMGEE WGSQAPFPFF CDFHGELADA VRRGRRKEFA GAYETYGDEV PDPLDESTFK
SAVIDWSERD DGRGAARLAL VKRLLDIRRN TLVPRLPGAR FGNAEIAEDG LLRARWRLGD
GATLKLVANL SDHDVAFKAP PDGTNVWGDD WNGMIPPWAV TWRLEES