Gene RPB_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1879 
Symbol 
ID3908074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2142414 
End bp2145200 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content67% 
IMG OID637883773 
Productmalto-oligosyltrehalose synthase 
Protein accessionYP_485498 
Protein GI86749002 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCCG CGATCCCGAC TGCGACTTAC CGTATCCAGC TCACCGCCGC TTTCGGCTTC 
GACGATGCGG CCGCGATCGT GCCGTATCTC AAGGCGCTCG GGATTTCGCA TCTCTACGCG
TCGCCCTTCA CCAAGGCGCG CCGCGGATCG ACCCACGGCT ACGACATCGT CGATCACACC
ACGCTCAACC CCGAACTCGG CGGCGAAGAG GCGTTCGCGC GACTGTCCGC GGCGCTGAAG
AGCCACGATA TCGGCCTGAT CCTCGACTTC GTCCCCAACC ATGTCGGCGT TCACTTCGCC
GACAATCCAT GGTGGCTGGA CGTTCTGGAA TGGGGCCCGG CATCGCCGCA TGCCGCCTCG
TTCGACATCG ACTGGGAGAT GCTGCCGTTC CGCAACCGCG GCGGGGTGTT GCTGCCGATC
ATCGGAACCT CCTACGGCAA GGCGCTGGAG AGCGGCGAGA TCGGGCTACG CTACGACGCC
GGCGACGGCA GTTTCTCGGC TTGGTACTTC GAACACCGGT TGCCGATCGC GCCGCAGCGC
TACAGCGAGA TCCTGCGTAC GATCGTACGC GAGGCCGATG CCACCGATCA TCCCGCCGGC
AAGGCGATCC TCGCGCTCGC CGCGCGCTAT CGCGGATTGC GCCACCCGGA TCGCAAGGAA
GCGCCGGACT TCAAGGCGGC GCTGAAGGCT GTTCCGGGCA GCGCCGACCT GATCGACAAG
GGCCTCGCCG CCTATCGGGC CGGCGAAGGC CGCAATACGC AGATTCAGGC GCTGCACAAT
CTGCTCGAAC GCCAGCACTA CAAGCTCGGC CATTGGCAAC TCGCCGCGAG CGAGATCAAC
TATCGCCGCT TCTTCGACGT CAACACCCTC GCCGGCTTGC GCGTCGAGGA CGCCGGCACG
TTCGAGGGGA TCCACACGCT GGTGAAGCGG CTGATCGCCA ACGGTCAGCT ACAGGGCCTG
CGGCTCGACC ACATCGACGG CCTGCGCGAC CCTGCGCAAT ATTTCCAGCG CCTGCGCCGG
CTCACCCGCG AGGCGCAGGG GCCTGCCGCG CCGCCGCTCT ACATGGTGAT CGAGAAGATC
CTCGGCGACG GCGAGCCGTT GCGGCGCTTC GCCGGTGTCC ACGGCACCAC CGGCTACGAA
TGGCTGAATG TCATCACCCA GGCGCTGGTC GACGGCGCAG GCCTGCAGCC GCTCGACGAG
GTCTGGCGGC AGGTGAGCAA CACCTCGCCG GATTTTCCGC CGGTGCTGAT GCGCGCCAAG
CGCCGCGTGC TGGAGACGCT GCTGCTCAGC GAATTCACCG TGCTGACGCG GCTGCTGGCC
CGGATCGCCA GCGGGCACTA TTCGACGCGC GATTTCTCCG CCGACAATCT GCGGCAGGTG
TTCGAACTCT ACGTGCTGCA CTTCCCGGTG TATCGCACCT ATATCAGCGG GTCCGGCCCG
AACGGACCCG ATCGCGAACT GATCGCCCAG ACCATCGAGA AGGCGCGCGC CGACTGGTTC
GGCGCGGACG ACGGCATTTT CGACTTCCTG CAGGACGCGC TGACGATGGA CCTGCTGAAG
GGCCGGGCCG CGCACAGCAA GCCGCGGGTG CGCCGCTTCG CGCTCAAGGT CCAGCAATTC
ACCGGGCCGA CCATGGCGAA GTCGCTAGAG GACACCGCCT TCTATCGCTA CCATCGCCTG
CTCGCGCTCA ACGAGGTCGG CGGCGAACCC GCCGCGCACG CGCTGGCGCC GGATGCCTTC
CATCAGCTGA TGACGCAGCG GGCGCAGGAC TGGCCGCACG GCATGACCGC GACCATGACC
CACGACGCCA AGCGCGGCGA AGACGCGCGG ACGCGGCTGC TGGCGCTGGC GGAGATGCCG
GGCGAATGGG CCAGCCTGGT CGCCAAATGG AAGCTGCTGA ACGCGGCCCA TCTGGTGACG
GACGGCGCGA TGCGGGCCCC GTCGGCGACG TTCGAATACA TGCTGTATCA GGGCCTGCTC
GGCGCCTGGC CGCTCGAACC CGACGCCGAC TTCACCGACC GGATTCAGGG CTACGCGCTG
AAGGCCGCGC GCGAAGGCAA AGAAGAGACC AACTGGATCA ACCCGAACCT CGCTTATGAG
GAAGGCATCC GCATCTTCGT CGACCGCATT CTCGACCCCG CCCAGTCCGG CGCGTTCCTC
GACTCGCTGC AGCGCGCGTC CGAGCGCGTC TCCGTGATCG GCGCGCTGAA TTCGCTGAGC
CAGGTCACGC TGAAGACGAT GATGCCCGGC GTCCCTGATT TGTATCAGGG CACGGAGTTC
TGGGACTTCT CTCTGGTCGA TCCCGACAAT CGCCGCCCGG TCGATTTTGC CGCCCGCGAA
AAGGCGCTCG CAGCGCTTGC CGAGCCGGAC TGGGACGCGC TGCTGCGCAA CTGGAGCGAC
GGCCGCGTCA AGCTGGCCTG GACCCGGCAG TTGCTCGCGA TCCGCAACGA ACTCCGCAGC
GTGTTCACCG ACGGGGATTA TCGGCCGCTC GCGATCTCGG GTCCGCATCG CGACCATGCG
ATCGCTTTCG CCCGCACCCG CGGCGCTCAG GCCGTGATCG TGGTAGTTGG AAAGAACTTC
GCACCGCTGT CGGACAACGG CCGGCGATGG CCGCGCGGCG ACGCGTTCGA CGCCACAGTG
GATGTTTCCG GATTCACCGT CGAAGGCACG ACCGGAAGCG ACGTGAAGCT CTCCGAGCTG
TTTCGCAATC TGCCGGTCGC GGTCCGTCAG GCGCGCCTGA GCAATCTCGC CGGCCCGGCT
CGGCCACGCC GAAAGACAAG CGCCTGA
 
Protein sequence
MPPAIPTATY RIQLTAAFGF DDAAAIVPYL KALGISHLYA SPFTKARRGS THGYDIVDHT 
TLNPELGGEE AFARLSAALK SHDIGLILDF VPNHVGVHFA DNPWWLDVLE WGPASPHAAS
FDIDWEMLPF RNRGGVLLPI IGTSYGKALE SGEIGLRYDA GDGSFSAWYF EHRLPIAPQR
YSEILRTIVR EADATDHPAG KAILALAARY RGLRHPDRKE APDFKAALKA VPGSADLIDK
GLAAYRAGEG RNTQIQALHN LLERQHYKLG HWQLAASEIN YRRFFDVNTL AGLRVEDAGT
FEGIHTLVKR LIANGQLQGL RLDHIDGLRD PAQYFQRLRR LTREAQGPAA PPLYMVIEKI
LGDGEPLRRF AGVHGTTGYE WLNVITQALV DGAGLQPLDE VWRQVSNTSP DFPPVLMRAK
RRVLETLLLS EFTVLTRLLA RIASGHYSTR DFSADNLRQV FELYVLHFPV YRTYISGSGP
NGPDRELIAQ TIEKARADWF GADDGIFDFL QDALTMDLLK GRAAHSKPRV RRFALKVQQF
TGPTMAKSLE DTAFYRYHRL LALNEVGGEP AAHALAPDAF HQLMTQRAQD WPHGMTATMT
HDAKRGEDAR TRLLALAEMP GEWASLVAKW KLLNAAHLVT DGAMRAPSAT FEYMLYQGLL
GAWPLEPDAD FTDRIQGYAL KAAREGKEET NWINPNLAYE EGIRIFVDRI LDPAQSGAFL
DSLQRASERV SVIGALNSLS QVTLKTMMPG VPDLYQGTEF WDFSLVDPDN RRPVDFAARE
KALAALAEPD WDALLRNWSD GRVKLAWTRQ LLAIRNELRS VFTDGDYRPL AISGPHRDHA
IAFARTRGAQ AVIVVVGKNF APLSDNGRRW PRGDAFDATV DVSGFTVEGT TGSDVKLSEL
FRNLPVAVRQ ARLSNLAGPA RPRRKTSA