Gene RPB_0635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0635 
Symbol 
ID3908328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp720399 
End bp721613 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID637882524 
Producttryptophan synthase subunit beta 
Protein accessionYP_484257 
Protein GI86747761 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.874907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0737735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA TCCTGCCGAA CTCGTTTCGA TCCGGTCCCG ACGAGCGCGG GCATTTCGGC 
ATCTTCGGCG GCCGCTTCGT CGCCGAGACG CTGATGCCGC TGATCCTCGC GCTGGAAAAG
GCCTACGCGG AAGCCAAGGA CGATCCGGCG TTCCGCGCCG AGATGGACGG CTACCTCAAG
CACTATGTCG GCCGGCCGTC GCCGCTGTAT TTCGCCGAGC GGCTGACCGA GCATTTCGGC
GGCGCCAAGA TCTACTTCAA GCGCGAGGAC CTCAACCACA CCGGCGCCCA CAAGGTGAAC
AACGTGCTCG GCCAGATCAT GCTGGCGCGG CGGATGGGCA AGCCGCGGAT CATCGCCGAA
ACCGGCGCCG GCATGCACGG CGTCGCCACC GCGACGATGT GCGCGAAATT CGGCCTGCAA
TGCGTCGTCT ATATGGGCGC GGTCGACGTC GACCGGCAGC AGCCCAACGT GCTGCGGATG
AAGGCGCTCG GCGCCGAAGT CCGCCCGGTG ACGTCCGGCG CCGCCACGCT CAAGGACGCG
ATGAACGAGG CGCTGCGCGA CTGGGTCACC AACGTCCACG ACACGTTCTA TTGCATCGGC
ACCGTCGCCG GCCCGCACCC CTATCCGATG ATGGTGCGCG ACTTCCAGGC GGTGATCGGC
CAGGAAGTGC GCGCGCAGAT CATGGAAGCC GAAGGCCGGC TGCCGGATTC GCTGATCGCC
TGCATCGGCG GCGGCTCCAA TGCGATGGGA CTGTTTCATC CCTTCCTGGA TGATAGCAGC
GTCGCGATCT ACGGCGTCGA GGCCGCGGGC CACGGCCTCA GCAAGCTGCA TGCGGCGTCG
ATCGCCGGCG GCAAGCCCGG CGTTCTGCAC GGCAACCGCA CCTATCTGCT GATGGACACC
GATGGCCAGA TCCAGGAAGC GCATTCGATC TCGGCCGGCC TCGACTATCC GGGCATCGGC
CCGGAACACG CCTGGCTGCA CGATGTCGGC CGCGTCGAGT TCATGTCCGC CACCGACACC
GAGGCGCTCG ACGCCTTCAA GCTGTGCTGC CGGCTGGAGG GCATCATCCC GGCGCTGGAG
CCGGCCCATG CGCTGGCGAA AGTCGGCGAC CTCGCCCCGC CCCTGCCGAA GGATCATGTG
ATGGTGCTCA ACATGTCGGG CCGCGGCGAC AAGGATCTCG CTTCGGTCGC CGAACATCTC
GGGGGCCAGT TCTGA
 
Protein sequence
MNQILPNSFR SGPDERGHFG IFGGRFVAET LMPLILALEK AYAEAKDDPA FRAEMDGYLK 
HYVGRPSPLY FAERLTEHFG GAKIYFKRED LNHTGAHKVN NVLGQIMLAR RMGKPRIIAE
TGAGMHGVAT ATMCAKFGLQ CVVYMGAVDV DRQQPNVLRM KALGAEVRPV TSGAATLKDA
MNEALRDWVT NVHDTFYCIG TVAGPHPYPM MVRDFQAVIG QEVRAQIMEA EGRLPDSLIA
CIGGGSNAMG LFHPFLDDSS VAIYGVEAAG HGLSKLHAAS IAGGKPGVLH GNRTYLLMDT
DGQIQEAHSI SAGLDYPGIG PEHAWLHDVG RVEFMSATDT EALDAFKLCC RLEGIIPALE
PAHALAKVGD LAPPLPKDHV MVLNMSGRGD KDLASVAEHL GGQF