Gene RPB_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1021 
Symbol 
ID3909145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1170697 
End bp1172913 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content66% 
IMG OID637882914 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_484642 
Protein GI86748146 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCT ACATGGCAGC TCTGAAATAT CTCGGCCTCG GCCTGCTGGT GATTCTGCTG 
CTCGGATTGG CGATCGCGCC GTTTTCCAAA CGCGTCGTCG AGCAATGGTC GCGCAGCGAC
GTCGAGGGCC GCTCGCGGCT GGTCTACAAC GCGATCCAGG GTCAGATCGT CGGCGCGATG
GCCGACGACA ACGTCGTGCA GCTCTCGGTG AGTTTCGAGG CGGTCGCGCT CGATCAGCGC
ATTCTCGCGG TCGGCCTGTG CAGCAGTCAG GGCCAGCTGA TCGCGCCGAC CCGGTTGATG
CCGGCGACGT TTTCCTGCGA GAAGGTGGCC CGCTCCGAGG GCGAGAGTTT CTCGACCATC
GTGATCGATG GCCGCCGGGT GATGGTCGGC GCGTTCCCGA TCGTCGATCG CGCGCGCAAA
TCCTATCTGG TCATCCTGCA CGATCTCAGC TTCGTCGACG TCCGCTCCAG CGAGGCACAG
GGCTTCATGA TCGCGGCGCT GGTCGGCGTG GCGCTGATGA TCGCCGGCAC CGCGGCGATC
ATCGTCGTGA TGGTGCTGCG CGGCTGGATG GCGGCGCTGC GGCGGGCGGT CGACGACGTG
CGTTCCGGCG CGGCGCCGCC ATTGCGGCAA CGCGACCCGT CGGCGATCGA CCTGCAGATC
AAGAAACTGC TGAGCGAAGT CGAAACCGGC CGCCATTCGA TCAGTTCGGC GCAGATCGAC
TGGTCGCCGG CGACGCTGCA GCAATTGCTG CATTCGACCC TGCCGGACGC CGAAGTGCTG
ATCGTCTCCA ACCGCGAGCC CTACATCCAC AATCGCGACG GCGACCGCAC CGAGGTGCAG
ATCCCGGCGA GCGGCCTGGT GTCGGCGCTG GAACCTGTGA TGCGCGCCTG CGGCGGCACC
TGGATCGCGC ACGGCAGCGG CAATGCCGAC CGCGACACGG TGGATTCCCA CGACCGCATC
GAGGTGCCGC CGGATCATCC GTCCTACAGG CTGCGCCGGA TCTGGATCAC CGACGAGGAA
CAGGACGGCT TCTATTACGG CTTCGCCAAT GAGGGCATGT GGCCGCTGTG CCATATCGCC
TTCGTACGGC CGACGTTCCG GGAATCCGAC TGGAAGGCCT ATCAGCGGAT CAACGAGCGC
TTCGCCGCCG CCGTCGTCGA GGAGGCGAAG ACCGACAACC CGATCGTGCT GGTGCAGGAC
TATCACTTCG CGCTGGCGCC GCGGATGATC CGCGATCGTC TACCGAAGGC GACCATCATC
ACCTTCTGGC ACATCCCGTG GCCGAACGCC GAGACCTTCA GCATCTGTCC GTGGAAGGAG
CAGATCATCG ACGGCCTGCT CGGCTCCACC ATCCTCGGCT TCCACACCCA GTTCCACTGC
AACAATTTCT TCGAGACCGT CGATCGCTTC GTCGAGAGCC GGATCGACCG CGAGCACGCC
ACCGTGACGC TGTCGGGCCA CGAAACCATG GTGCGGGCCT ATCCGATCTC GATCGAATGG
CCGCCGGCGG CGCTGGACGG CCAGCCGCCG GTCGAAACCT GCCGCCGCGA GGTGCGCGAG
GCGCTCGGGC TCGCCGCCGA CGTCAGGATC GCGGTCGGCA TCGAGCGCTT CGACTACACC
AAGGGGATTC TCGACCGCAT GAAGGCGGTC GACGATCTGC TGACGCGGCA GCCGCAATGG
AAACGCCAGC TGGTGTTCGT CCAGGTCGCG GCGCCGACGC GCAGCAAATT GTCGAGCTAC
AGCACGCTGC AGGACGACGC GGTGGCGCTC GCCGACGACA TCAACCGGCG GCACGGATCG
GACGGCTACA AGCCGATCGT GCTGCTGATC CGGCATCACA GCGCGCGCGA GGTGTTCAAG
CTGTTCCGCG CCGCCGACGT CTGCATCGTC AGCAGCCTGC ACGACGGCAT GAACCTCGTC
GCCAAGGAAT TCGCCGCCGC CCGCGACGAC GAGCGCGGCG TGCTGGTGCT GTCGAGCTTC
ACCGGCGCCT CGCGCGAACT GTCCGAGGCG CTGATCGTCA ATCCGTATCA CGTCCACGAA
ACCGCGACCG CGCTCGACAC CGCGCTGCGG ATGCCGGAGC ACGAACAGCA GGAGCGGATG
CGCGCGATGC GTCAGCAGAT CCGCGAGTGG AACGTGTATC GCTGGGCCGG CCGGATGCTG
ATCGACGCCG CCACCAGCCG CCGCCGCCAG CGCATTCTCG ATCTCGCCGA GGGCTGA
 
Protein sequence
MPRYMAALKY LGLGLLVILL LGLAIAPFSK RVVEQWSRSD VEGRSRLVYN AIQGQIVGAM 
ADDNVVQLSV SFEAVALDQR ILAVGLCSSQ GQLIAPTRLM PATFSCEKVA RSEGESFSTI
VIDGRRVMVG AFPIVDRARK SYLVILHDLS FVDVRSSEAQ GFMIAALVGV ALMIAGTAAI
IVVMVLRGWM AALRRAVDDV RSGAAPPLRQ RDPSAIDLQI KKLLSEVETG RHSISSAQID
WSPATLQQLL HSTLPDAEVL IVSNREPYIH NRDGDRTEVQ IPASGLVSAL EPVMRACGGT
WIAHGSGNAD RDTVDSHDRI EVPPDHPSYR LRRIWITDEE QDGFYYGFAN EGMWPLCHIA
FVRPTFRESD WKAYQRINER FAAAVVEEAK TDNPIVLVQD YHFALAPRMI RDRLPKATII
TFWHIPWPNA ETFSICPWKE QIIDGLLGST ILGFHTQFHC NNFFETVDRF VESRIDREHA
TVTLSGHETM VRAYPISIEW PPAALDGQPP VETCRREVRE ALGLAADVRI AVGIERFDYT
KGILDRMKAV DDLLTRQPQW KRQLVFVQVA APTRSKLSSY STLQDDAVAL ADDINRRHGS
DGYKPIVLLI RHHSAREVFK LFRAADVCIV SSLHDGMNLV AKEFAAARDD ERGVLVLSSF
TGASRELSEA LIVNPYHVHE TATALDTALR MPEHEQQERM RAMRQQIREW NVYRWAGRML
IDAATSRRRQ RILDLAEG