Gene RPB_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1341 
Symbol 
ID3907849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1527685 
End bp1528665 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID637883235 
Producthomoserine kinase 
Protein accessionYP_484962 
Protein GI86748466 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.528429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.431849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCT ACACCGACGT CGCCGCCGAC GACCTCGCGG ATTTCCTCAA ATCCTATGAG 
ATCGGCGATC TGTTGTCCTA CAAGGGCATC GCCGAGGGCG TCGAGAATAC CAATTTCCTG
CTGCACACCA CGCGCGGCAG CTTCATTCTC ACGCTGTACG AGAAGCGCGT CGCCTCCGAG
GATCTGCCGT ATTTCCTGGC GCTGATGGCG CATCTGGCCG CGCGCGGCGT CAGTTGCCCG
CAGCCCGAAA AGACCCGCGA CGGCGAGATC TGCGGCGCGT TGTCCGGCCG CCCGGCGGTG
ATCATCAATT TCCTCGAAGG CGTCTGGCCG CGCCGTCCCA ACGCGGTGCA TTGCGCCGGC
GTCGGCGAGG CGCTGGCCAA GATGCACCTC GCCGGCCTGG ATTTTCCGCA GCATCGCGCC
AATCCGCTGT CGGTGTCGGG CTGGCGGCCG CTGTTCGACC TCGCCGCCGC GCGCGCCGAC
GAGATCCAGC CAGGCTTGCG CGATTTCATC GCCGCCGAGC TCGATCACCT CGAAGGCCGC
TGGCCGCGGC ATCTGCCGAC TGGCGTGATC CATGCCGATC TGTTTCCGGA CAACGTTTTC
TTCATCGGCG ACACGCTGTC GGGACTGATC GACTTCCCGT TCTCCTGCAA CGACATCCTC
GCCTACGACG TGGCGATCTG CCTGAATGCC TGGTGCTTCG AGCCGGACCA CGCCTTCAAC
GTCACCAAGG CGCGGGCGCT GCTGAATGCG TATCAACGCG GCCGCGCCTT GAGCGAGGCC
GAGCAGACGG CGCTGCCGCT GCTGGCGCGC GGCGCGGCGA TGCGCTTCCT GCTGACCCGG
CTGGTCGATG TTCTCGACGT GCCGGAAGGC GCGCTGGTCA AGCCGAAGGA TCCGCTGGAA
TATTTCCGCA AGCTGCGCTT CCAGCAAAAT GTCGCCAGCA TTCGCGATTA TGGTGTCGAA
GCTGCGGGAG CGGTGGCGTG A
 
Protein sequence
MAVYTDVAAD DLADFLKSYE IGDLLSYKGI AEGVENTNFL LHTTRGSFIL TLYEKRVASE 
DLPYFLALMA HLAARGVSCP QPEKTRDGEI CGALSGRPAV IINFLEGVWP RRPNAVHCAG
VGEALAKMHL AGLDFPQHRA NPLSVSGWRP LFDLAAARAD EIQPGLRDFI AAELDHLEGR
WPRHLPTGVI HADLFPDNVF FIGDTLSGLI DFPFSCNDIL AYDVAICLNA WCFEPDHAFN
VTKARALLNA YQRGRALSEA EQTALPLLAR GAAMRFLLTR LVDVLDVPEG ALVKPKDPLE
YFRKLRFQQN VASIRDYGVE AAGAVA