Gene RPB_3714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3714 
Symbol 
ID3911516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4252697 
End bp4254061 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content67% 
IMG OID637885616 
Producttryptophan synthase subunit beta 
Protein accessionYP_487320 
Protein GI86750824 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC AGATCAAGTA CGTCCTGGAC GAGGAGAACA TCCCGAAGTC CTGGTACAAT 
CTCAATGCCG ACTTTCCCAA GCCGGTACCC GACGTGCTGC ATCCGGGCAC GCATCAGCCG
GTCGGCCCGT CCGATCTGGA GCCGCTGTTT CCGATGGAGC TGATCCTCCA GGAGGTCGCC
ACCGATCGCT ACATCGACAT TCCCGCGCCG GTGCGCGACG TGTTCCGGAT GTGGCGGCCG
TCGCCGCTGG TGCGTGCGCG CCGCCTCGAG CAGGCGCTCG GCACGCCGGC CAAGATCTAC
TACAAATACG AAGGCGTCTC GCCGGCCGGC TCGCACAAGC CGAACACCGC GGTGCCGCAG
GCCTGGTACA ACAAAGAGGC CGGCATCAAG AAGCTGTCGA CCGAGACCGG TGCCGGGCAG
TGGGGCTCGT CGCTGGCCTT CGCCGGCTCG CTGTTCGGGC TCGACGTGCT GGTGTTCCAG
GTCCGCGTCT CGTTCGACCA GAAACCGTAT CGCCGCGCGC TGATGGAAAC CTACGGCGCG
CGCTGCATCG CCTCGCCCTC GACCGAGACC GAGTCCGGCC GCGCCATCCT GGCGCAGCAT
CCCGACAGCC CCGGCTCGCT CGGCATCGCG ATCTCCGAAG CCGTCGAAGT CGCGGCGAAG
AATCCGGATA TCAAATACGC GCTCGGCTCG GTGCTCAATC ACGTCATGCT GCACCAGACC
ATCATCGGCC AGGAAGCGAT CAAGCAGTGC GAAATGGCCG GTGACGATCC CGACGTGATC
ATCGGCTGCG CCGGCGGCGG CTCGAATTTC GCAGGCCTTG CGTTCCCGTT CCTCGGCCTG
CAGCTGCGCG GCGGCCGGTC GCGGCGGATC ATCGCGGTCG AGCCCGCGGC GTGCCCGACG
CTGACGCGCG GCACCTACGC CTATGATTTC GGCGACACCG CGCATCTGAC GCCCTTGGTG
AAGATGCACA CGCTGGGCTC GACCTTCATT CCGCCGGGCT TCCACGCCGG CGGCCTGCGC
TATCACGGCA TGAGCGGGAT GGTGTCGCAC GCCTACGAGC TCGGCCTGAT CGAGGCGCGT
GCCTATCACC AGGTGAAGTG CTTCGAAGCC GGCGTGCAGT TCGCCCGCAA CGAGGGCATC
GTGCCGGCGC CGGAATCGAC CCACGCGGTG CGCTGCGCGA TCGACGAGGC GCTGCGCTGC
AAGGCGGAGG GCAAGGCGGA GACGATCCTG TTCAACCTCT CGGGTCACGG CCATTTCGAC
ATGCAGGCCT ACATCAACTA CTACGAAGGC AAGCTCGTCG ACGTCGACTA CAACGAAGCC
GACCTCGCGA CCGCGCTGGC GGGCCTGCCG GCGGTGGCGG CTTAG
 
Protein sequence
MSDQIKYVLD EENIPKSWYN LNADFPKPVP DVLHPGTHQP VGPSDLEPLF PMELILQEVA 
TDRYIDIPAP VRDVFRMWRP SPLVRARRLE QALGTPAKIY YKYEGVSPAG SHKPNTAVPQ
AWYNKEAGIK KLSTETGAGQ WGSSLAFAGS LFGLDVLVFQ VRVSFDQKPY RRALMETYGA
RCIASPSTET ESGRAILAQH PDSPGSLGIA ISEAVEVAAK NPDIKYALGS VLNHVMLHQT
IIGQEAIKQC EMAGDDPDVI IGCAGGGSNF AGLAFPFLGL QLRGGRSRRI IAVEPAACPT
LTRGTYAYDF GDTAHLTPLV KMHTLGSTFI PPGFHAGGLR YHGMSGMVSH AYELGLIEAR
AYHQVKCFEA GVQFARNEGI VPAPESTHAV RCAIDEALRC KAEGKAETIL FNLSGHGHFD
MQAYINYYEG KLVDVDYNEA DLATALAGLP AVAA