Gene RSc2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc2233 
SymbolthrB 
ID1221078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp2422915 
End bp2423919 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content68% 
IMG OID637238632 
Producthomoserine kinase 
Protein accessionNP_520354 
Protein GI17546952 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTTT TCACCCCGGT CACCAACGCC GAGATCGCCC TCTGGCTGGA GCAATACGAC 
GTGGGCACGG TCCGCGCGCT GCGCGGCATT CCCTCGGGGA TCGAAAACAC CAACTTCTTC
CTGACCACGG AGAAGGACGG CGCCACGCAC GAGTACGTCG TCACGCTGTT CGAGCGGCTG
ACCAGCGAGC AACTGCCGTT CTACCTGTAC CTGATGCAGC ATCTGGCGCA GCACGGCATC
TGCGTGCCGG CGCCGATTCC CGGCCGCGAC GGCGCGATCC TGCGCCCGCT CAAGGGCAAG
CCGGCGACCA TCGTGACGCG CCTGCCCGGA CGCTCGAACC TGGCGCCCAC GACGAGCGAA
TGCGCCATCG TCGGCGACAT GCTGGCGCGC ATGCACCTGG CCGGCCGCGA CTACCCGCGG
CACCAGCCCA ACCTGCGCAG CCTGCCGTGG TGGAACGAAG TGGTGCCCGA CATCCAGCCC
TTCGTGCAGG GCGCCACGCG CGAGCTGCTG GTCGCCGAGC TGGCCCACCA GCAGCGCTTC
TTCGGCAGCG CCGACTATGC CGCCCTGCCC GAGGGCCCGT GCCACTGCGA CCTGTTCCGC
GACAACGTGC TGTTCGAGCC GGCCACTGAC AGCCAGCCCG AGCGCCTGGG CGGGTTCTTC
GATTTCTATT TCGCCGGCGT CGACAAATGG CTGTTCGACG TGGCCGTGAC CGTCAACGAC
TGGTGCGTCG ACCTCGCCAC GGGTGCGCTC GATGCCGAAC GGATGCGCGC CATGCTGCGC
GCCTATCACG CGGTGCGGCC TTTCACCGAC GCGGAGGCCC GTCACTGGCG GGACATGCTG
CGCGCCGCGG CCTATCGCTT CTGGGTATCG CGCCTGTGGG ACTTCCACCT GCCGCGCGAC
GCCGAACTGC TGCAGCCGCA TGATCCGACC CACTTCGAGC GCGTGCTGCG CGAACGGGTG
CGCGCCGAGG GGCTGACATT GGATATTCCC GAACCATGCA ACTGA
 
Protein sequence
MAVFTPVTNA EIALWLEQYD VGTVRALRGI PSGIENTNFF LTTEKDGATH EYVVTLFERL 
TSEQLPFYLY LMQHLAQHGI CVPAPIPGRD GAILRPLKGK PATIVTRLPG RSNLAPTTSE
CAIVGDMLAR MHLAGRDYPR HQPNLRSLPW WNEVVPDIQP FVQGATRELL VAELAHQQRF
FGSADYAALP EGPCHCDLFR DNVLFEPATD SQPERLGGFF DFYFAGVDKW LFDVAVTVND
WCVDLATGAL DAERMRAMLR AYHAVRPFTD AEARHWRDML RAAAYRFWVS RLWDFHLPRD
AELLQPHDPT HFERVLRERV RAEGLTLDIP EPCN