Gene RPC_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1943 
Symbol 
ID3973575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2113151 
End bp2115403 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content60% 
IMG OID637925054 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_531819 
Protein GI90423449 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTACC GATACAAGAC AGCCGCGAGG TATCTTGGAG TTGGAGTGGC TTTGACGCTG 
TTACTCGGCG TCGCAATTGC CCCGTTTTCA AAGCGGCTGG TTGAACAGTG GTCGAGGAGC
GACGTCGAGG CGCGATCGCG ACTGGTCTAC AATGCCATCC AGGACCAAGT CATGCGCGCA
ATGGCTGACA ATGATAACGT GCAGCTTTCG GTTATCTTTG AAGCGATCGC GCTGGATCAA
CGGATCTTGG CGGTTGGACT CTGCAACGAC GCGGGCCGGT TGACGGCGCC GACCAAACTG
ATGCCGGCGA CGTTCTCGTG TGAAAAAGTC GCGCGGTCGG AAGCCGAGAG CTTCTCCAGC
ATCGCCAGCG ACGGCCGCCG CATCTTGGTC AGCTCGTTTC CGATCGTCGA TCGAAACCAC
AAATCCTATC TGGTCGTTCT TCATGATCTG AGCTTTATCG ATCAACGTTC GAGCCAGGCG
CAAGGGCTGA TGATCGCCGG CTTGATCGGT ATGACCATCG CCATCGCCGG CGGCGCGGCG
CTGTTCGTGG TCATGATGCT TCGGGGCTGG ATGCATTCGC TGCGACGCGC GATCGAGGAC
CTGCGATCGG GGGCGACATC GCCGACGGGA AGCGACCAAA CCGCGATCGG CCTGCAGATC
AAGAAGCTGC TCGGCGAGGT TGAGGTTGGG CAGAATTCAA TTCGTTCGGC GCAGATCGAT
TGGTCGCCGG CAACGTTGCA GGAGTTGCTG CGCACCACGC TGCCCGGCGC CGAAGTGTTG
ATCGTCTCGA ATCGCGAGCC CTACATCCAC AACCACGTCA GCGGTGGAAC GCACATCGCC
GTCCAAATCC CGGCGAGCGG CCTCGTCTCG GCGCTGGAGC CGGTGATCCG GGCGTGCGGC
GGCACCTGGA TCGCGCATGG CAGCGGCGAC GCCGACCGCG AAACCGTCGA TCACGACGAC
CGGGTCAGGG TGCCGCCCGA TCATCCAAGT TACACCTTGC GTCGGATCTG GATCTCCGAC
GAAGAGCAGG ATGGTTTTTA TTACGGGTTT TCCAACGAGG GGCTCTGGCC GCTGTGCCAC
ATCGCCTTCA TCAGACCGTC GTTTCGAGTA TCGGATTGGG AACAGTACCA GGCGATCAAT
GAACGGTTCG CGATGGCTGT CGTCGAGGAG GCAAAGTCGG ACGATCCGAT CGTTCTGGTG
CAGGACTATC ACTTCGCGTT AGCGCCACAG ATGATTCGCC GACGGTTGCC GAAGGCGACC
ATCGTCACCT TCTGGCATAT CCCGTGGCCC AACGCCGAGA CCTTCAGTAT CTGTCCTTGG
AAGGAGCAAA TCATCGAGGG TCTGCTCGGG AGCACGATTC TCGGCTTTCA CACCCAGTTT
CACTGCAACA ACTTCTTCGA AACGGTCGAC CGGTTTGTCG AAAGCCGGAT CGACCGCGAA
CATGCGACCG TCTCGCTCGG GGGCCACGAG ACGATGGTGC GCGCCTATCC GATTTCGATC
GAATGGCCGC CCGCGGCGTT GCAAGGCCAG CCCGCGATGG CGACTTGCCG CGAAGAGGTC
AGGCAGTCGC TCAACATCGC TGCCGACATG AAGCTGGCGG TCGGCATCGA GCGCTTCGAT
TATACCAAGG GCATCCTGGA TCGCATGAAT GCCATCGACG ATCTATTGTC GCGCTACCCG
GAGTGGAAGG GGCGGCTGGT CTTTGTTCAG GTTGCCGCCC CGACCCGCAG CAAACTCGCC
GCCTACAGCA CGCTGCAAGC CGATGCGGTG TCGCTCGCCG AATCGATCAA CCGTCGGCAC
GGCTCGGCTT CCTACACACC GATCCGGTTG TTGATCCGGC ATCATGGGGC AGATGAGGTG
TTCAAATTGT TCCGGGCGGC CGATGTCTGT ATCGTGAGCA GTCTGCACGA TGGCATGAAC
CTGGTCGCCA AGGAATTCGT TGCCGCCCGC GAAGATGAAA GTGGGGTGTT GTTGCTGTCG
AGTTTCACCG GAGCGTCGCG GGAGTTGTCC GAGGCGTTGA TCGTCAATCC CTACCACGTC
CACGAGATGT CTGGCGCGCT TGATGCCGCG CTCCGAATGC CGCTGCTCGA GCAACAGGAG
CGTATGCGGG TGATGCGACA GCAAATCAAG GAATGGAACG TCTATCGTTG GGCCGGCCGG
ATGCTGATCG ACGCCGCCAA CAAGAGGCGG CAGCAACGCA TCATGAACCT CGCCAATCTC
GGCCGGCTGC CATCGGCGGA CGCTCTCCGC TAA
 
Protein sequence
MSYRYKTAAR YLGVGVALTL LLGVAIAPFS KRLVEQWSRS DVEARSRLVY NAIQDQVMRA 
MADNDNVQLS VIFEAIALDQ RILAVGLCND AGRLTAPTKL MPATFSCEKV ARSEAESFSS
IASDGRRILV SSFPIVDRNH KSYLVVLHDL SFIDQRSSQA QGLMIAGLIG MTIAIAGGAA
LFVVMMLRGW MHSLRRAIED LRSGATSPTG SDQTAIGLQI KKLLGEVEVG QNSIRSAQID
WSPATLQELL RTTLPGAEVL IVSNREPYIH NHVSGGTHIA VQIPASGLVS ALEPVIRACG
GTWIAHGSGD ADRETVDHDD RVRVPPDHPS YTLRRIWISD EEQDGFYYGF SNEGLWPLCH
IAFIRPSFRV SDWEQYQAIN ERFAMAVVEE AKSDDPIVLV QDYHFALAPQ MIRRRLPKAT
IVTFWHIPWP NAETFSICPW KEQIIEGLLG STILGFHTQF HCNNFFETVD RFVESRIDRE
HATVSLGGHE TMVRAYPISI EWPPAALQGQ PAMATCREEV RQSLNIAADM KLAVGIERFD
YTKGILDRMN AIDDLLSRYP EWKGRLVFVQ VAAPTRSKLA AYSTLQADAV SLAESINRRH
GSASYTPIRL LIRHHGADEV FKLFRAADVC IVSSLHDGMN LVAKEFVAAR EDESGVLLLS
SFTGASRELS EALIVNPYHV HEMSGALDAA LRMPLLEQQE RMRVMRQQIK EWNVYRWAGR
MLIDAANKRR QQRIMNLANL GRLPSADALR