Gene Mpe_A1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1146 
Symbol 
ID4785721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1225404 
End bp1226378 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content70% 
IMG OID640089709 
Producthomoserine kinase 
Protein accessionYP_001020342 
Protein GI124266338 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.16142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.462949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCT TCACCGAAGT CGGCTTCGAC GAAGCCGCGG CCCTCGCGCA CAAGGTCGGC 
CTCGGCCCGC TGAAGGCGCT CAAGCCCATC AAGGCCGGGA TCGAGAACAC CAACTATTTC
CTCACCGCCG AGCGCGGCGA ATACGTGCTG ACGCTGTTCG AACGGCTGAG CGCCGAGCAG
CTGCCGTTCT ATCTGCACCT GATGAAGCAC CTGGCCGGAC GCGACATCCT GGTCCCGATG
CCGCAGTCCG ATGCCCACGG CGAGATCCTC CATGCGCTGT GCGGCAAGCC GGCCGCCATC
GTGGAGCGCC TGCGCGGCGG TCATGTGCTG GCCCCCAGCG CGGCGCATTG CCAGCAGGTC
GGCGCCATGC TGGCCCGCAT GCACGAGGCC GGCCGAGACT ACCCGTACCA CCAGCCCAAC
CTGCGAGGGC TGGCGTGGTG GGACGAGACG GTGCCGCAGA TCCTGACCTT CCTCGCACCG
CCGCAGCGCG CGCTGCTGGA GGACGAGCTG GCGTTCCAGC ACAGCGTCAG CGCGTCGGCC
TCCGACGCCG CCCTGCCGCG CGGACCGATC CATGCCGATC TGTTCCGCGA CAACGTGATG
TTCGACGAGG ACGCCGAGGG CCGCTCCCAG CTCACCGGCT TCTTCGACTT CTACTTCGCC
GGCGTCGACC GGCTCGCCTT CGACATCGCC GTCTGCCTGA ACGACTGGTG CATCGACCTC
GCCAGCGGCC GGCTGCTGGA AGACCGGGCC GCGGCCTTCG TGACGGCCTA CGCCGAGGTG
CGCCAACTCA CCGGCGACGA ACTGCGCCTG CTGCCCGCGC TGCTGCGCGC CGCGGCGCTG
CGCTTCTGGA TCTCGCGGCT GTGGGACGTG CACCTCCCGC GCGAGGCCAG CATGCTGGTG
CCGCACGACC CCACCCACTT CGAGCGCGTG CTGCGCGAGC GCATCGCGAC GCCCTGGCAT
CCCGCGAGCC GCTGA
 
Protein sequence
MAVFTEVGFD EAAALAHKVG LGPLKALKPI KAGIENTNYF LTAERGEYVL TLFERLSAEQ 
LPFYLHLMKH LAGRDILVPM PQSDAHGEIL HALCGKPAAI VERLRGGHVL APSAAHCQQV
GAMLARMHEA GRDYPYHQPN LRGLAWWDET VPQILTFLAP PQRALLEDEL AFQHSVSASA
SDAALPRGPI HADLFRDNVM FDEDAEGRSQ LTGFFDFYFA GVDRLAFDIA VCLNDWCIDL
ASGRLLEDRA AAFVTAYAEV RQLTGDELRL LPALLRAAAL RFWISRLWDV HLPREASMLV
PHDPTHFERV LRERIATPWH PASR