Gene RPB_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3969 
Symbol 
ID3911776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4533532 
End bp4534743 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content67% 
IMG OID637885873 
Product5-aminolevulinate synthase 
Protein accessionYP_487573 
Protein GI86751077 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0108645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.403503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTACG AAGCCTATTT CCGCCGTCAG CTCGAAGGCC TGCATCGTGA GGGCCGGTAT 
CGGGTGTTCG CCGATCTGGA ACGCCATGCC GGCGCCTATC CCCGCGCGAC GCATCACCGG
CCGGACGGCA CCGGCGACGT GACGGTGTGG TGCTCCAACG ATTACCTCGG CATGGGCCAG
CACCCGGCGG TGCTGAAGGC GATGCACGAG GCGCTGGACA GCTGCGGCGC CGGCGCCGGC
GGCACCCGCA ACATCGCGGG AACGAATCAC TATCACGTGC TGCTCGAGCA GGAGCTGGCG
GCGCTGCACG GCAAGGAATC CGCGCTGCTG TTCACCTCCG GCTACGTCTC CAACTGGGCG
TCGCTGTCGA CGCTGGCGTC GCGCATGCCC GGCTGCGTGA TCCTGTCCGA CGAGCTCAAC
CACGCCTCGA TGATCGAGGG CATCCGCCAC AGCCGCAGCG AAACCCGAAT CTTCGCGCAC
AACGACCCGC GCGACCTCGA GCGCAAGCTT GCCGATCTCG ATCCGCATGC GCCCAAGTTG
GTCGCCTTCG AGTCGGTGTA TTCGATGGAT GGCGATATCG CTCCGATCGC CGAGATCTGC
GACGTCGCCG ATGCGGCCAA CGCCATGACC TATCTCGATG AAGTCCATGG TGTCGGGCTG
TACGGCCCGA ACGGCGGCGG CATTGCGGAT CGCGAGGGCC TCAGCCATCG CCTCACCATC
ATCGAGGGCA CCCTGGCCAA AGCGTTCGGC GTGGTCGGCG GCTACATTGC CGGCTCCGCG
GCGGTGTGCG ATTTCGTCCG CAGCTTCGCT TCCGGCTTCA TCTTCAGCAC CTCGCCGCCG
CCCGCAGTGG CCGCCGGCGC GCTGGCGAGC ATCCGGCATC TGCGCGCCTC TTCCATCGAG
CGCGAACGCC ATCAGGACCG GGTGGCGCGA CTGCGCGCCC GGCTCGATCA GGCCGGCGTG
GCCCACATGC CGAACCCCAG CCATATCGTG CCGGTGATGG TCGGCGACGC AGCGCTGTGC
AAGCAGATCA GTGACGAGCT GATCAACCGC TACGGCATCT ATGTTCAGCC GATCAACTAT
CCGACCGTCC CGCGTGGCAC CGAGCGGCTG CGGATCACGC CGTCGCCGCA GCACTCCGAC
GCGGACATCG AGCATCTGGT CCAGGCGCTC AGCGAAATCT GGGCTCGCGT CGGCCTCGCC
AAGGCGGCCT GA
 
Protein sequence
MNYEAYFRRQ LEGLHREGRY RVFADLERHA GAYPRATHHR PDGTGDVTVW CSNDYLGMGQ 
HPAVLKAMHE ALDSCGAGAG GTRNIAGTNH YHVLLEQELA ALHGKESALL FTSGYVSNWA
SLSTLASRMP GCVILSDELN HASMIEGIRH SRSETRIFAH NDPRDLERKL ADLDPHAPKL
VAFESVYSMD GDIAPIAEIC DVADAANAMT YLDEVHGVGL YGPNGGGIAD REGLSHRLTI
IEGTLAKAFG VVGGYIAGSA AVCDFVRSFA SGFIFSTSPP PAVAAGALAS IRHLRASSIE
RERHQDRVAR LRARLDQAGV AHMPNPSHIV PVMVGDAALC KQISDELINR YGIYVQPINY
PTVPRGTERL RITPSPQHSD ADIEHLVQAL SEIWARVGLA KAA