Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3533 |
Symbol | |
ID | 3911335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4045003 |
End bp | 4045869 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637885435 |
Product | dihydropteroate synthase |
Protein accession | YP_487139 |
Protein GI | 86750643 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.139072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.316671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAG CGCAGCAAAC CCCAGCGCCT CCCACCGCGA CCGGCGCCGC CGCGTTGCGG GCGCTGCTGG CGCGCCCGGT GCCGGTGGTG ATGGGCATTC TCAACGTCAC CCCCGATTCG TTCTCCGACG GCGGCGACTT TCTCGGGCCC GAGCGGGCGC GGGCGCATGC GGCGCGGATG ATCGCGGACG GCGCCGACAT CATCGACATC GGCGCCGAGA GCACCCGCCC CTACGGGTCG ACGCCGGTCT CGGCCGACGA GGAGCGGCGG CGGCTGGCGC CGGTGCTGGC GGCGGTCGCA GCACTCGGCG CGCCGGTGTC GATCGACAGC ATGAAGGCCG AGGTGGTGGC CTGGGCGCTC GACCATGGGG CTGCGATCGC CAACGACGTC TGGGGCCTGC AGCGCGATCC GGCGATGGCG GAGGTCGTCG CGGCGCGCGG CGCGCCGGTG ATCGTGATGC ATAATCGCGA CGACGCCGAC CCGTCGATCG ATATCGTCGC TGATATCAAC GCGTTCTTCG AACGCTCGCT GGCGATCGCC GCCCGCGCCG GCATCGCCGA GCACAGCATC GTGCTCGATC CCGGCATCGG CTTCGGCAAG ACGCCGCAGC AGAGCATGAT CGCGCTGGCG CGGCTGGGCG CCTTCGCGCA TTTCGGCCTG CCCGTGCTGG TCGGCGCCTC GCGCAAGCGT TTCATCAGCA CGGTGGCGCC GTCGGAACCG AAGCAGCGGC TCGGCGGTTC GATCGCCGCG CATCTCATCG CGATGGAGAA CGGCGCGCGA ATCATCCGCG CCCACGACGT CGCCGACACC GCGCAGGCGC TCAAAGTCGC GCACGCGATC AGGACCAGCA GCGAGGACAG ACGATGA
|
Protein sequence | MSAAQQTPAP PTATGAAALR ALLARPVPVV MGILNVTPDS FSDGGDFLGP ERARAHAARM IADGADIIDI GAESTRPYGS TPVSADEERR RLAPVLAAVA ALGAPVSIDS MKAEVVAWAL DHGAAIANDV WGLQRDPAMA EVVAARGAPV IVMHNRDDAD PSIDIVADIN AFFERSLAIA ARAGIAEHSI VLDPGIGFGK TPQQSMIALA RLGAFAHFGL PVLVGASRKR FISTVAPSEP KQRLGGSIAA HLIAMENGAR IIRAHDVADT AQALKVAHAI RTSSEDRR
|
| |