Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2167 |
Symbol | |
ID | 3909947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2458560 |
End bp | 2459810 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884061 |
Product | tyrosinase |
Protein accession | YP_485784 |
Protein GI | 86749288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.649942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0551649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTCC AAATCGATAT TCCCGGTGAA GATGCCCAAG GTCGCGTGTT TCTCGGCTGG ACGCCCGTTC AGGCCAGCGC CCGGTTGTTG CAGGGTCCAG GTGCCGGCTC CGTCGACGTC GAGATCAGCA GCGCCGGCGC GGTCGGCGGG CTGGTGTTCG ATACGGCGCG AACGCACAAC GGCGGCTCGC GCCTGACGCT CGGCCTGCCG GGTGACGGCC GGCCGGTGAC GTTCTTCGTC GCGGGCGAAT TCCTCAAGCC GAGTTCGCGT TACGGCGACG CCGCCATCGC CGTGAAGGAC AAGGCCAGCG GCGCCCCGCT CGCCAGCAAG CCGGTGATGG TGCGCATCCG CAAAAATGCG GTCACGTTGA GCCAGGAGGA GCGCGACGAC TTCCTCGCCG CGCTCGGCAC CCTGAATGCG CGCGGCCAGG GTCCCTATCG TATCGTTCGC GATATGCACG ACGCCGATTC GGATCTCGAA ATCCATCGCA ACGAAGGCTT CCTGCCGTGG CACCGCGCCT ACGTGCTCGG GCTCGAGCGC GCGCTCCAGG CGATCAACCC GGTGGTGACG CTGCCCTATT GGCAGTTCGA CGCGCCCGCG CCTGTGCTCT TCACCATCGA CTACATGGGA CGATCCGACG AATCCGGCCA CGTCGTTTTC CGGCCCGGGC ACTCGCTCGA GCACTGGGTG GCGAAGGATA CGCCCGGGAT CGTCCGCGTG CCGCGATTTG CGTCCGACGC TCCGGCGCTG GTCATCAGCG AAGACGATAC GATCAGGCTC GGCGGGGCGA CAGCCGATTT CGCCCTGTTT CGGCAAATGG AGGGCGGGCC GCACGGTCAG GCGCACAACA GCTTTGCCGC CCCCAGCCCG CTCAGGTACC CCGCCCTCGC CGTCTATGAT CCTTTGTTCT TCCTGCTCCA CTGCAATGTC GACCGGCTGT GGACGAAATG GCAGTGGATC AAGCACCGCA CGGACAGCTC CGACCGGCTC GCCTACAGCG ACGGCACGCG CGCCGGCACC AAGCGCGGCG ACACGATGTG GCCGTGGAAT GGCATCCACG GCAATCCGCG GCCGCCGACT GCACCGGGCG GACCGTTTCC TCCGACCAGT ACCACGCCGG CGCCGGGCCG CACGCCAAGG GTCAGCGACA TGCTCGATGC GATGGCGCTC AAGGCGCCCG ATCCGCTCGG CTTCGCCTAC GACGACGTGC CGTTCCAGCT GCCGCCGACC GTGGTTGCAG GTCATGCCTA G
|
Protein sequence | MQVQIDIPGE DAQGRVFLGW TPVQASARLL QGPGAGSVDV EISSAGAVGG LVFDTARTHN GGSRLTLGLP GDGRPVTFFV AGEFLKPSSR YGDAAIAVKD KASGAPLASK PVMVRIRKNA VTLSQEERDD FLAALGTLNA RGQGPYRIVR DMHDADSDLE IHRNEGFLPW HRAYVLGLER ALQAINPVVT LPYWQFDAPA PVLFTIDYMG RSDESGHVVF RPGHSLEHWV AKDTPGIVRV PRFASDAPAL VISEDDTIRL GGATADFALF RQMEGGPHGQ AHNSFAAPSP LRYPALAVYD PLFFLLHCNV DRLWTKWQWI KHRTDSSDRL AYSDGTRAGT KRGDTMWPWN GIHGNPRPPT APGGPFPPTS TTPAPGRTPR VSDMLDAMAL KAPDPLGFAY DDVPFQLPPT VVAGHA
|
| |