Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3969 |
Symbol | |
ID | 3911776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4533532 |
End bp | 4534743 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885873 |
Product | 5-aminolevulinate synthase |
Protein accession | YP_487573 |
Protein GI | 86751077 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes |
TIGRFAM ID | [TIGR00858] 8-amino-7-oxononanoate synthase [TIGR01821] 5-aminolevulinic acid synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0108645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.403503 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTACG AAGCCTATTT CCGCCGTCAG CTCGAAGGCC TGCATCGTGA GGGCCGGTAT CGGGTGTTCG CCGATCTGGA ACGCCATGCC GGCGCCTATC CCCGCGCGAC GCATCACCGG CCGGACGGCA CCGGCGACGT GACGGTGTGG TGCTCCAACG ATTACCTCGG CATGGGCCAG CACCCGGCGG TGCTGAAGGC GATGCACGAG GCGCTGGACA GCTGCGGCGC CGGCGCCGGC GGCACCCGCA ACATCGCGGG AACGAATCAC TATCACGTGC TGCTCGAGCA GGAGCTGGCG GCGCTGCACG GCAAGGAATC CGCGCTGCTG TTCACCTCCG GCTACGTCTC CAACTGGGCG TCGCTGTCGA CGCTGGCGTC GCGCATGCCC GGCTGCGTGA TCCTGTCCGA CGAGCTCAAC CACGCCTCGA TGATCGAGGG CATCCGCCAC AGCCGCAGCG AAACCCGAAT CTTCGCGCAC AACGACCCGC GCGACCTCGA GCGCAAGCTT GCCGATCTCG ATCCGCATGC GCCCAAGTTG GTCGCCTTCG AGTCGGTGTA TTCGATGGAT GGCGATATCG CTCCGATCGC CGAGATCTGC GACGTCGCCG ATGCGGCCAA CGCCATGACC TATCTCGATG AAGTCCATGG TGTCGGGCTG TACGGCCCGA ACGGCGGCGG CATTGCGGAT CGCGAGGGCC TCAGCCATCG CCTCACCATC ATCGAGGGCA CCCTGGCCAA AGCGTTCGGC GTGGTCGGCG GCTACATTGC CGGCTCCGCG GCGGTGTGCG ATTTCGTCCG CAGCTTCGCT TCCGGCTTCA TCTTCAGCAC CTCGCCGCCG CCCGCAGTGG CCGCCGGCGC GCTGGCGAGC ATCCGGCATC TGCGCGCCTC TTCCATCGAG CGCGAACGCC ATCAGGACCG GGTGGCGCGA CTGCGCGCCC GGCTCGATCA GGCCGGCGTG GCCCACATGC CGAACCCCAG CCATATCGTG CCGGTGATGG TCGGCGACGC AGCGCTGTGC AAGCAGATCA GTGACGAGCT GATCAACCGC TACGGCATCT ATGTTCAGCC GATCAACTAT CCGACCGTCC CGCGTGGCAC CGAGCGGCTG CGGATCACGC CGTCGCCGCA GCACTCCGAC GCGGACATCG AGCATCTGGT CCAGGCGCTC AGCGAAATCT GGGCTCGCGT CGGCCTCGCC AAGGCGGCCT GA
|
Protein sequence | MNYEAYFRRQ LEGLHREGRY RVFADLERHA GAYPRATHHR PDGTGDVTVW CSNDYLGMGQ HPAVLKAMHE ALDSCGAGAG GTRNIAGTNH YHVLLEQELA ALHGKESALL FTSGYVSNWA SLSTLASRMP GCVILSDELN HASMIEGIRH SRSETRIFAH NDPRDLERKL ADLDPHAPKL VAFESVYSMD GDIAPIAEIC DVADAANAMT YLDEVHGVGL YGPNGGGIAD REGLSHRLTI IEGTLAKAFG VVGGYIAGSA AVCDFVRSFA SGFIFSTSPP PAVAAGALAS IRHLRASSIE RERHQDRVAR LRARLDQAGV AHMPNPSHIV PVMVGDAALC KQISDELINR YGIYVQPINY PTVPRGTERL RITPSPQHSD ADIEHLVQAL SEIWARVGLA KAA
|
| |