Gene Mkms_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3970 
Symbol 
ID4611909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4182948 
End bp4184285 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content69% 
IMG OID639793653 
Producthomoserine dehydrogenase 
Protein accessionYP_939952 
Protein GI119870000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.781726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGTG ACAAATCGGT AGGCGACACA CCGGTGGGCG TAGCCGTCCT GGGCCTCGGC 
AATGTGGGCA GTGAAGTGGT CCACATCATC GAGCAGAGCG CGACCGACCT GGCGGCGCGT
GTCGGCGCCC CGCTCGTACT GCGCGGGGTT GGGGTGCGCC GGGTGGCCGG CGACCGCGGG
GTGCCGGTCG ACATGCTCAC CGACAACATC GAAGAACTGG TCTCGCGCGA GGACGTCGAC
ATCGTCGTCG AGGTGATGGG TCCAGTCGAA CCGGCCCGCA AGGCGATCCT CTCCGCGCTC
GAACAGGGCA AGTCGGTGGT GACCGCGAAC AAGGCGCTGA TGGCCCAGTC GACCGGTGAG
CTGGCGCAGG CCGCCGAGGC CGCCCGGGTC GACCTGTACT TCGAGGCCGC GGTGGCCGGT
GCCATCCCGG TGATCCGCCC GCTGACCCAG TCGCTGGCCG GCGACACGGT GCTGCGGGTG
GCCGGGATCG TCAACGGCAC AACGAATTAC ATCCTGTCGG CGATGAACGA CACCGGCGCC
GACTACGACA GTGCGCTGGC CGATGCCAGT GCGCTCGGCT ACGCGGAGGC GGATCCGACC
GCCGACGTGG AGGGTTTCGA CGCCGCGGCC AAGGCCGCGA TCCTGGCGTC CATCGCGTTC
CACACGCGGG TGACGGCCGA CGACGTCTAC CGCGAAGGCA TCACCAAGGT CACCGCCGAG
GACTTCGAAT CCGCCCGCTC GCTCGGGTGC ACCATCAAGT TGCTCGCCAT CTGTGAGCGG
CTGACCACCG ACGACGGCCA ACAGCGCGTC TCGGCCCGGG TCTACCCGGC GCTGGTTCCG
CTCGACCACC CGCTGGCATC GGTCAACGGC GCGTTCAACG CCGTGGTGGT CGAGGCCGAG
GCGGCCGGCC GCTTGATGTT CTACGGCCAG GGCGCCGGTG GCGCGCCGAC CGCATCGGCG
GTCATGGGCG ATCTCGTGAT GGCCGCGCGC AACCGCGTCC AGGGCGGGCG CGGACCGCGG
GAGTCGAAGT ACGCCAAGCT GCCGGTGTCG CCGATCGGGT TCATCCCCAC GCGGTACTAC
GTCAACATGA ACGTCGCCGA CCGTCCTGGC GTGTTGTCCA CGGTCGCAGC CGAATTCGCC
AGGCATGAGG TCAGCATCGC CGAGGTGCGC CAGGAGGGTG TGGTCGACGA GGGCGGACAG
CCCTGCGGTG CGCGCATCGT CGTCGTCACC CACCGTGCGA CCGATGCGGC GTTGTCCGAA
ACCGTCTCGG CCCTGGCCGA ACTCGACGTC GTGCAGAGCG TCAACAGCGT GCTGCGCATG
GAAGGAACAA GCGAATGA
 
Protein sequence
MNSDKSVGDT PVGVAVLGLG NVGSEVVHII EQSATDLAAR VGAPLVLRGV GVRRVAGDRG 
VPVDMLTDNI EELVSREDVD IVVEVMGPVE PARKAILSAL EQGKSVVTAN KALMAQSTGE
LAQAAEAARV DLYFEAAVAG AIPVIRPLTQ SLAGDTVLRV AGIVNGTTNY ILSAMNDTGA
DYDSALADAS ALGYAEADPT ADVEGFDAAA KAAILASIAF HTRVTADDVY REGITKVTAE
DFESARSLGC TIKLLAICER LTTDDGQQRV SARVYPALVP LDHPLASVNG AFNAVVVEAE
AAGRLMFYGQ GAGGAPTASA VMGDLVMAAR NRVQGGRGPR ESKYAKLPVS PIGFIPTRYY
VNMNVADRPG VLSTVAAEFA RHEVSIAEVR QEGVVDEGGQ PCGARIVVVT HRATDAALSE
TVSALAELDV VQSVNSVLRM EGTSE