Gene RPD_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1764 
Symbol 
ID4022246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1979859 
End bp1981025 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID637961958 
Productthreonine aldolase 
Protein accessionYP_568901 
Protein GI91976242 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.382651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.287208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCG TGGTGGCCGG CACAAGTCGA TCGATCGCAG AGCGCTCTGC AAATCGCACC 
CATTCCGCTT CACGCCGGCG CTGTGCTAAT CTGCCCCTCA TGGCCTATAT CCCTGCTCTG
CCGGATTCCA ACGCGCCGCC GGTGCGCATC AATCTCCTCT CCGACACCCA GACGCGACCG
AGCGCAGCGA TGCGGGAGGC GATGGCGCGC GCGGAGGTCG GCGACGAACA AATGGGCGAC
GATCCGACCG TCAATGCGCT GAACGAGCGC GTCGCGGCGC TGCTCGGCAA GCAAGCCGCG
GTGTTCCTGC CGTCGGGCAC GATGTGCAAC GTCACCGCGA CGCTCAGCAC CTGTCGTCCC
GGCGACGAGA TCATCGCGCA TCGCACCGCG CATATTCTGT CGCGCGAAGG CGGCGCCCAT
GCGGCGATCG GCGGCTTCCA GATCACGCCG CTCGACGGCG ACGACGGCCA GTTCTCGCTC
GATGCGTTTC GCGCCGCGCT GCATCCGCGC TCGCGCTACC AGCCGCCGCA GACGATGGTC
AGCGTCGAGC AGACCGCCAA TATCGGCGGC GGCACGATCT GGCCGCAGGC GACGCTCGAT
GCGATCGCCG CGGCAGGCAA GGCGGCCGGA CTCGTCACCC ATATGGACGG CGCGCGGCTG
ATGAACGCCG TGGTCGCCAC CGGCATCACC GCCCGCGACA TGGCCGCGGG CTGGGATTCG
GTGTGGATCG ATTTCAGCAA AGGGCTCGGC GCTCCGGTGG GCGCGTCGCT CGCCGGCTCA
CGCGACTTCA TCGACGAGGT GTGGCGCTGG AAGCAGCGGC TCGGCGGCTC GATGCGGCAG
GCCGGCGTGA TCGCCGCCGC CTGCAACTAC GCGCTAGACC ATCACGTCGA ACGGCTCGCG
GAGGATCACG CCCATGCGCG GGCGCTGGCC GCGGGACTGG CGCAGATCGC GGGCGTCGAG
GTGCAGGAGC CGCAGACCAA TCTGGTGTTC TTCAGCCCCG AAGGCGCCGG CCTCGGCGGC
GAGGCCATGG TCGCGGCGCT GCGCCAGCGC GGCGTGCTGC TGGCGACGAT GGACGGCCGG
ATCCGGGCTT GCACCCATCT CGACGTCAGC CGCGACATGA TCGACGAGAC GATCGGTCTG
GTGCGCGAGA TTCTGCGTCG TCAATAA
 
Protein sequence
MRRVVAGTSR SIAERSANRT HSASRRRCAN LPLMAYIPAL PDSNAPPVRI NLLSDTQTRP 
SAAMREAMAR AEVGDEQMGD DPTVNALNER VAALLGKQAA VFLPSGTMCN VTATLSTCRP
GDEIIAHRTA HILSREGGAH AAIGGFQITP LDGDDGQFSL DAFRAALHPR SRYQPPQTMV
SVEQTANIGG GTIWPQATLD AIAAAGKAAG LVTHMDGARL MNAVVATGIT ARDMAAGWDS
VWIDFSKGLG APVGASLAGS RDFIDEVWRW KQRLGGSMRQ AGVIAAACNY ALDHHVERLA
EDHAHARALA AGLAQIAGVE VQEPQTNLVF FSPEGAGLGG EAMVAALRQR GVLLATMDGR
IRACTHLDVS RDMIDETIGL VREILRRQ