Gene RPD_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4029 
Symbol 
ID4024546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4477435 
End bp4478415 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID637964232 
Producthomoserine kinase 
Protein accessionYP_571149 
Protein GI91978490 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCT ATACCGACGT CGCCGCCGAC GAACTCGCGG ACTTTCTCAA GGCCTACGAG 
ATCGGCGACC TGCTATCCTA CAAAGGCATC GCCGAGGGCG TCGAGAACTC CAACTTCCTG
CTGCATACCA CGCGCGGCCA CTTCATCCTG ACGCTGTACG AAAAGCGCGT CGCCGCCGAC
GACCTGCCGT ATTTTCTGTC GCTGATGGCG CATCTCGCCG CGCGCGGCGT CAGTTGCCCG
CAGCCGGCGA CGAATCGTGC GGGCGAAGTC TGCGGGACGT TGTCCGGCCG CCCGGCGGTG
ATCATCAATT TCCTCGAAGG CGTCTGGCCG CGCCGGCCTA ATCTGGCCCA CTGCGCAGGC
GTCGGCGAGG CGATGGCGAA GATGCACCGC GCAGGCCTGG ACTATCCCTC CTACCGCTCT
AATCCGCTGT CGGTGACAGG CTGGCGGCCG CTGTTCAACA TCGCGGCCTC GCGGGCCGAC
GAGATCCAGC CCGGCCTGCG CGATTTCATC GCCGCCGAAC TCGATTATCT CGAAGGCAAC
TGGCCCGATC AATTGCCGAC CGGCGTGATC CACGCCGATC TGTTTCCGGA CAATGTGTTC
TTCATCGGCG ACAAGCTGTC GGGGCTGATC GACTTTCCGT TCTCCTGCAA CGATATCCTC
GCCTACGACG TGGCGATCTG CCTGAACGCC TGGTGCTTCG AGCCGGATCT TTCGTTCAAC
GTCACCAAGG CCCGGGCGCT GCTCAACGCC TATCAGCGTG AACGCGCGTT GAGCGAGGCC
GAGCAGGCGG CGCTGCCGTT GCTGGCGCGC GGCGCGGCGA TGCGCTTCCT GCTGACGCGG
CTGGTCGATT TCCTCGACGT GCCGGCGGGC GCGCTGGTCC GCCCGAAGGA TCCGCTGGAA
TACGTCCGCA AGCTGCGCTT CCAGCAGAAC GTCGCCGGCA TTCGCGACTA CGGCGTCGAA
GCGGCGGGAG CAGTGGCGTG A
 
Protein sequence
MAVYTDVAAD ELADFLKAYE IGDLLSYKGI AEGVENSNFL LHTTRGHFIL TLYEKRVAAD 
DLPYFLSLMA HLAARGVSCP QPATNRAGEV CGTLSGRPAV IINFLEGVWP RRPNLAHCAG
VGEAMAKMHR AGLDYPSYRS NPLSVTGWRP LFNIAASRAD EIQPGLRDFI AAELDYLEGN
WPDQLPTGVI HADLFPDNVF FIGDKLSGLI DFPFSCNDIL AYDVAICLNA WCFEPDLSFN
VTKARALLNA YQRERALSEA EQAALPLLAR GAAMRFLLTR LVDFLDVPAG ALVRPKDPLE
YVRKLRFQQN VAGIRDYGVE AAGAVA