Gene Pmen_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmen_2012 
Symbol 
ID5109903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas mendocina ymp 
KingdomBacteria 
Replicon accessionNC_009439 
Strand
Start bp2222617 
End bp2223645 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content71% 
IMG OID640503253 
Producthomoserine dehydrogenase 
Protein accessionYP_001187505 
Protein GI146307040 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.148212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.080267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGC TCAAGGTGGC CATCGCCGGC TTCGGCGGCG TCGGTCGCGC CACGGCCGAG 
CTGCTGCTGC AGCGCCGCGA CCGCTATCGC CAGGTCTACG GCGTGGACGT GCGCCTGGTC
GCGGTGTGTG GCTCGCGCGC CGGCCTGGCC GACGCCAACG GCCTGCAAGC CGGGCAGCTG
GCCGCGCTGC AGCCCGGTCT GACGGGGCCC GCGTTCATCG CGGCGAGCGG CGCGTCGGTG
CTCATCGAAG CCGGCCCCAG CGATTTTCGC AGCGGCGGGC CGGGCCTTGC CTACCTGCGC
GAGGCGCTGG CGGCCGGGCA GGACTGCATC GTCATAAGCA AAGGCGCACT GGTGCACAGC
GGCCCGCAGC TGCGCGAACT CGCGCGCACC TCGGGGGCCA TGCTGAAACT CAGCGGCGCC
GCGGCGGCGG CCCTGCCCAC GCTGGATCTG CTCCAGCACA GCCTGGCCGG CTGCCAGGTG
CTCGCCGTCG AGGGCATCCT CAACGCCACC ACCAACTACC TGCTCGATGC CATGCGCACC
CAGGGGCTCG GCTTCGACGC GGCGCTGCAT GAGGCGCAGG CCGGCGGCTT CGCCGAAGCG
GACACGCGCA ACGACACCGA AGGCTGGGAT ACCGCCTGCA AACTCCTGCT GCTGGCCAAC
TTCGGCCTGG GCGCCGATCT GACCATGGAA GATGTCGACG TCGAGGGCAT CCACGCGGTG
ACGGCGCAGC GTATCGACAG CTGGGCGAAG CAGGGGCTGG TGCCCAGGCT GGTCGGCCGT
CTCGAGCGGG TGAACGGCAC GCTGCGCGCC AGCGTCGGCA TCAAGACCTA CCCGCTGTCC
GACCCTTTCG CTCAGGTCAA TGGCAAGAAC AAGGCGATCC GCATCAGCAG CGATGCCATG
GGCGAAACCC TCGCCATCGG CTGCGGCGTC GAACCGCTCG CCACCGCGGC GGCCGCGCTC
AAGGACCTCG AACATATTCT CCAGGCCAGG GCCGCGCGCC CGGCCCTCTC TACGGACCAC
CCAGCATGA
 
Protein sequence
MQTLKVAIAG FGGVGRATAE LLLQRRDRYR QVYGVDVRLV AVCGSRAGLA DANGLQAGQL 
AALQPGLTGP AFIAASGASV LIEAGPSDFR SGGPGLAYLR EALAAGQDCI VISKGALVHS
GPQLRELART SGAMLKLSGA AAAALPTLDL LQHSLAGCQV LAVEGILNAT TNYLLDAMRT
QGLGFDAALH EAQAGGFAEA DTRNDTEGWD TACKLLLLAN FGLGADLTME DVDVEGIHAV
TAQRIDSWAK QGLVPRLVGR LERVNGTLRA SVGIKTYPLS DPFAQVNGKN KAIRISSDAM
GETLAIGCGV EPLATAAAAL KDLEHILQAR AARPALSTDH PA