Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3714 |
Symbol | |
ID | 3911516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4252697 |
End bp | 4254061 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885616 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_487320 |
Protein GI | 86750824 |
COG category | [R] General function prediction only |
COG ID | [COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) |
TIGRFAM ID | [TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACC AGATCAAGTA CGTCCTGGAC GAGGAGAACA TCCCGAAGTC CTGGTACAAT CTCAATGCCG ACTTTCCCAA GCCGGTACCC GACGTGCTGC ATCCGGGCAC GCATCAGCCG GTCGGCCCGT CCGATCTGGA GCCGCTGTTT CCGATGGAGC TGATCCTCCA GGAGGTCGCC ACCGATCGCT ACATCGACAT TCCCGCGCCG GTGCGCGACG TGTTCCGGAT GTGGCGGCCG TCGCCGCTGG TGCGTGCGCG CCGCCTCGAG CAGGCGCTCG GCACGCCGGC CAAGATCTAC TACAAATACG AAGGCGTCTC GCCGGCCGGC TCGCACAAGC CGAACACCGC GGTGCCGCAG GCCTGGTACA ACAAAGAGGC CGGCATCAAG AAGCTGTCGA CCGAGACCGG TGCCGGGCAG TGGGGCTCGT CGCTGGCCTT CGCCGGCTCG CTGTTCGGGC TCGACGTGCT GGTGTTCCAG GTCCGCGTCT CGTTCGACCA GAAACCGTAT CGCCGCGCGC TGATGGAAAC CTACGGCGCG CGCTGCATCG CCTCGCCCTC GACCGAGACC GAGTCCGGCC GCGCCATCCT GGCGCAGCAT CCCGACAGCC CCGGCTCGCT CGGCATCGCG ATCTCCGAAG CCGTCGAAGT CGCGGCGAAG AATCCGGATA TCAAATACGC GCTCGGCTCG GTGCTCAATC ACGTCATGCT GCACCAGACC ATCATCGGCC AGGAAGCGAT CAAGCAGTGC GAAATGGCCG GTGACGATCC CGACGTGATC ATCGGCTGCG CCGGCGGCGG CTCGAATTTC GCAGGCCTTG CGTTCCCGTT CCTCGGCCTG CAGCTGCGCG GCGGCCGGTC GCGGCGGATC ATCGCGGTCG AGCCCGCGGC GTGCCCGACG CTGACGCGCG GCACCTACGC CTATGATTTC GGCGACACCG CGCATCTGAC GCCCTTGGTG AAGATGCACA CGCTGGGCTC GACCTTCATT CCGCCGGGCT TCCACGCCGG CGGCCTGCGC TATCACGGCA TGAGCGGGAT GGTGTCGCAC GCCTACGAGC TCGGCCTGAT CGAGGCGCGT GCCTATCACC AGGTGAAGTG CTTCGAAGCC GGCGTGCAGT TCGCCCGCAA CGAGGGCATC GTGCCGGCGC CGGAATCGAC CCACGCGGTG CGCTGCGCGA TCGACGAGGC GCTGCGCTGC AAGGCGGAGG GCAAGGCGGA GACGATCCTG TTCAACCTCT CGGGTCACGG CCATTTCGAC ATGCAGGCCT ACATCAACTA CTACGAAGGC AAGCTCGTCG ACGTCGACTA CAACGAAGCC GACCTCGCGA CCGCGCTGGC GGGCCTGCCG GCGGTGGCGG CTTAG
|
Protein sequence | MSDQIKYVLD EENIPKSWYN LNADFPKPVP DVLHPGTHQP VGPSDLEPLF PMELILQEVA TDRYIDIPAP VRDVFRMWRP SPLVRARRLE QALGTPAKIY YKYEGVSPAG SHKPNTAVPQ AWYNKEAGIK KLSTETGAGQ WGSSLAFAGS LFGLDVLVFQ VRVSFDQKPY RRALMETYGA RCIASPSTET ESGRAILAQH PDSPGSLGIA ISEAVEVAAK NPDIKYALGS VLNHVMLHQT IIGQEAIKQC EMAGDDPDVI IGCAGGGSNF AGLAFPFLGL QLRGGRSRRI IAVEPAACPT LTRGTYAYDF GDTAHLTPLV KMHTLGSTFI PPGFHAGGLR YHGMSGMVSH AYELGLIEAR AYHQVKCFEA GVQFARNEGI VPAPESTHAV RCAIDEALRC KAEGKAETIL FNLSGHGHFD MQAYINYYEG KLVDVDYNEA DLATALAGLP AVAA
|
| |