Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4175 |
Symbol | |
ID | 3911983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4746044 |
End bp | 4747159 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886079 |
Product | branched-chain amino acid aminotransferase |
Protein accession | YP_487778 |
Protein GI | 86751282 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR01123] branched-chain amino acid aminotransferase, group II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.121006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTCT CGTTCGACAA CAAGGTCGAC GGCTCGGTCT GGCTGAATTT CGACATTCAG CCGTCGGCGC ATCCAGTGTC GGAGACCGAG CGGACCGCGA AGCTCGCGGA CCCGGGCTTC GGCCGCGTCT TCACCGACCA CATGGCGATC GTCCGCTACG ATCAGGCCAA GGGCTGGTAC GGCGCGCGCG TCGAGTCCCG CGCCAATTTC CCGCTCGATC CGGCGACCGC CGTGCTGCAC TACGCGCAGG AGATCTTCGA GGGGCTGAAG GCCTACAAGC GTGCCGATGG CGGCGTGAAC CTGTTCCGCC CCGACGCCAA TGCGCGGCGC TTCCGCGATT CCGCCGACCG CATGGCGATG GCGCAGCTTC CCGAGCCGGT GTTCATCGAG GCGATCGAAC AGCTGGTCCG GATCGACCGC GCGTGGATCC CGGGCGGCGA CGGCAGCCTG TATCTGCGGC CGTTCATGAT CGCCAGCGAG GTGTTCCTCG GCGTCAAGCC GTCGGCCGAA TACATCTTCG CGGTGATCGC CTCGCCGGTC GGCTCCTACT TCAAGGGCGG CCCCGCGCCG GTGTCGATCT GGGTGTCGGA GAACTACACC CGCGCCGCGA TCGGCGGCAC CGGCAGCGTC AAATGCGGCG GCAACTACGC GGCGTCGCTG CGCGCCCAGG CCGAGGCGAT CGCGCATGGC TGCGATCAGG TGGTGTTCCT CGACGCGATC GAGCGCCGCT ATATCGAGGA ACTCGGCGGC ATGAACGTGT TCTTCGTGTT CGACGACGGC TCGCTGTCGA CGCCGCCGCT CGGCACCATC CTGCCCGGCA TCACCCGCGA TTCGATCATC GCGCTGGCGC GGCAGGCCGG CCGCACCGTG CGCGAGGAAG CCTACACGAT CGAGCAATGG CGCGCCGATG CGGCCAGCGG CAAGCTGAAA GAGGCGTTCG CCTGCGGCAC CGCGGCGGTG ATCTCGCCGA TCGGCACCGT GCGCTCGGCG AGCGGCGACT TCACCATCAA CGGCGGCGTC GCCGGCGAGG TGGCGATGGG CCTGCGCAAG CAGCTCGTCG ACATCCAATA CGGCCGCGCC GAGGACAAGC ACGGCTGGAT CAGGGACGTG GCGTAA
|
Protein sequence | MGLSFDNKVD GSVWLNFDIQ PSAHPVSETE RTAKLADPGF GRVFTDHMAI VRYDQAKGWY GARVESRANF PLDPATAVLH YAQEIFEGLK AYKRADGGVN LFRPDANARR FRDSADRMAM AQLPEPVFIE AIEQLVRIDR AWIPGGDGSL YLRPFMIASE VFLGVKPSAE YIFAVIASPV GSYFKGGPAP VSIWVSENYT RAAIGGTGSV KCGGNYAASL RAQAEAIAHG CDQVVFLDAI ERRYIEELGG MNVFFVFDDG SLSTPPLGTI LPGITRDSII ALARQAGRTV REEAYTIEQW RADAASGKLK EAFACGTAAV ISPIGTVRSA SGDFTINGGV AGEVAMGLRK QLVDIQYGRA EDKHGWIRDV A
|
| |