Gene Rpal_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0072 
Symbol 
ID6407715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp74866 
End bp76080 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID642709981 
Producttryptophan synthase subunit beta 
Protein accessionYP_001989110 
Protein GI192288505 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00296113 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAAG CTTTGCCGAA TTCCTTCCGG TCCGGCCCCG ACGAGCGCGG GCATTTCGGT 
ATCTATGGCG GCCGCTTCGT CGCCGAGACG CTGATGCCGC TGATCCTCGA TCTGGAAAAG
GCCTATGCGG AAGCCAAGGC CGACCCGGCG TTCCGCGCCG AGATGGACAA CCATCTCAAG
CACTATGTCG GACGTCCGTC GGCTTTGTAT TTCGCCGAGC GGCTGACCGA GCATTTCGGC
GGCGCCAAGA TCTACTTCAA GCGCGAAGAT CTCAATCACA CCGGCGCTCA CAAGGTGAAC
AACGTGCTCG GCCAGATCAT GCTGGCCAAG CGCATGGGCA AGCCGCGGGT GATCGCCGAG
ACCGGCGCCG GCATGCACGG CGTCGCCACC GCGACGATGT GCGCCAAATT CGGCCTCGAA
TGCGTGGTGT TCATGGGCGC GGTCGACGTC GAACGCCAGC AGCCCAACGT GCTGCGGATG
AAGGCGCTCG GCGCAGAAGT CCGCCCCGTC ACCTCCGGCG CCAACACGCT GAAGGACGCG
ATGAACGAGG CGCTGCGTGA CTGGGTCACC AACGTCCACG ACACGTTCTA TTGCATCGGC
ACGGTCGCGG GTCCGCATCC CTATCCGATG ATGGTGCGCG ACTTCCAGGC GGTGATCGGT
CAGGAAGTCC GCGAGCAGAT CATGCAGGCC GAAGGTCGCC TGCCCGACTC GCTGGTCGCC
TGCATCGGCG GCGGCTCCAA CGCGATGGGG CTGTTCCATC CGTTCCTCGA CGATCCGGGC
GTCGCGATCT ACGGCGTCGA AGCTGCGGGC CATGGGCTCG ACAAGCTGCA CGCGGCGTCG
ATCGCCGGCG GCAAGCCGGG CGTGCTGCAC GGCAACCGCA CCTATCTGCT GATGGATGCG
GACGGCCAGA TCGAGGAAGC GCATTCGATC TCCGCCGGCC TCGACTATCC GGGCGTCGGC
CCCGAGCACT CCTGGCTGCA CGACGTCGGC CGCGTCAACT TCCTGTCCGC CACCGACACC
GAAGCGCTCG ACGCGTTCAA GCTGTGCTGC CGACTCGAAG GCATCATCCC GGCGCTGGAG
CCGAGCCACG CGCTCGCCAA GGTCGCCGAC CTCGCGCCCA AGCTGCCGAA GGATCACCTG
ATGGTCGTGA ACATGTCCGG CCGCGGCGAC AAGGACCTCG CGTCGGTCGC AGAACATCTC
GGGGGCAAGT TCTGA
 
Protein sequence
MNQALPNSFR SGPDERGHFG IYGGRFVAET LMPLILDLEK AYAEAKADPA FRAEMDNHLK 
HYVGRPSALY FAERLTEHFG GAKIYFKRED LNHTGAHKVN NVLGQIMLAK RMGKPRVIAE
TGAGMHGVAT ATMCAKFGLE CVVFMGAVDV ERQQPNVLRM KALGAEVRPV TSGANTLKDA
MNEALRDWVT NVHDTFYCIG TVAGPHPYPM MVRDFQAVIG QEVREQIMQA EGRLPDSLVA
CIGGGSNAMG LFHPFLDDPG VAIYGVEAAG HGLDKLHAAS IAGGKPGVLH GNRTYLLMDA
DGQIEEAHSI SAGLDYPGVG PEHSWLHDVG RVNFLSATDT EALDAFKLCC RLEGIIPALE
PSHALAKVAD LAPKLPKDHL MVVNMSGRGD KDLASVAEHL GGKF