Gene RPD_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3724 
Symbol 
ID4024240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4159005 
End bp4160216 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID637963928 
Product5-aminolevulinate synthase 
Protein accessionYP_570846 
Protein GI91978187 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00666631 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00153498 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTACG AAGCCTATTT CCGCCGTCAA CTTGACGGCC TCCATCGTGA AGGCCGGTAT 
CGGGTTTTCG CTGATCTGGA ACGTCATGCC GGCTCGTTCC CGCGCGCCAC GCACCACCGG
CCTGAGGGCG CCGGCGATGT GACGGTGTGG TGCTCCAACG ATTATCTCGG CATGGGCCAG
CACCCGGCGG TGCTGACGGC CATGCACGAG GCGCTGGACA GCTGCGGCGC CGGCGCCGGC
GGCACCCGCA ACATCGCGGG TACCAACCAC TATCACGTGC TGCTGGAGCA GGAGCTCGCC
GCTCTGCATG GCAAGGAATC CGCCCTGCTG TTCACCTCCG GTTACGTTTC GAACTGGGCG
TCGTTGTCGA CGCTGGCGTC GCGGATGCCC GGCTGCGTGA TCCTGTCGGA CGAACTCAAT
CATGCTTCGA TGATCGAGGG CATTCGTCAC AGCCGCAGCG AGACCCGAAT TTTCGCGCAC
AATGATCCGC GCGACCTCGA GCGCAAGCTC GCCGATCTCG ACCCGCACGC GCCGAAGCTG
GTCGCCTTCG AGTCGGTGTA TTCGATGGAC GGCGACATCG CGCCGATCGC CGAAATCTGC
GACGTCGCCG ACGCGCATAA CGCGATGACG TATCTCGATG AAGTCCACGG CGTCGGCCTG
TACGGCCCGA ATGGCGGTGG CATTGCCGAT CGCGAGGGCA TCAGCCATCG CCTCACCATC
ATCGAAGGCA CGCTGGCGAA AGCGTTCGGC GTGGTCGGCG GCTACATCGC CGGCTCCTCG
GCCGTCTGCG ATTTCGTCCG CAGCTTCGCC TCGGGCTTCA TCTTCAGCAC CTCGCCGCCT
CCGGCGGTCG CTGCAGGCGC GCTGGCCAGC ATCCGCCATC TGCGCGCCAG TTCGGCGGAG
CGCGAGCGTC ATCAGGATCG GGTGGCGCGG CTGCGCGCCA GGCTCGATCA GGCCGGCGTG
GCCCACATGC CGAACCCGAG CCATATCGTT CCGGTCATGG TCGGCGATGC CGCGCTGTGC
AAGCAGATCA GCGACGAGTT GATCAGCCGG TACGGCATTT ATGTGCAGCC GATCAACTAT
CCGACCGTGC CGCGCGGCAC CGAGCGCCTT CGGATCACGC CGTCACCGCA GCACACCGAT
GCCGACATCG AGCATCTGGT GCAGGCGCTC AGTGAAATCT GGACCCGCGT CGGTCTCGCC
AAGGCGGCCT GA
 
Protein sequence
MNYEAYFRRQ LDGLHREGRY RVFADLERHA GSFPRATHHR PEGAGDVTVW CSNDYLGMGQ 
HPAVLTAMHE ALDSCGAGAG GTRNIAGTNH YHVLLEQELA ALHGKESALL FTSGYVSNWA
SLSTLASRMP GCVILSDELN HASMIEGIRH SRSETRIFAH NDPRDLERKL ADLDPHAPKL
VAFESVYSMD GDIAPIAEIC DVADAHNAMT YLDEVHGVGL YGPNGGGIAD REGISHRLTI
IEGTLAKAFG VVGGYIAGSS AVCDFVRSFA SGFIFSTSPP PAVAAGALAS IRHLRASSAE
RERHQDRVAR LRARLDQAGV AHMPNPSHIV PVMVGDAALC KQISDELISR YGIYVQPINY
PTVPRGTERL RITPSPQHTD ADIEHLVQAL SEIWTRVGLA KAA