Gene Mkms_3143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3143 
Symbol 
ID4610978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3289961 
End bp3291241 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID639792814 
Productthreonine dehydratase 
Protein accessionYP_939127 
Protein GI119869175 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR02079] threonine dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.449239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.1577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCTG AACTGAGCCA GACCCCTCGT ACGTCGCCGA TCACCGCGGC TGACATCGAC 
GAGGCGGCCC AGCGGATCGT CGACGTCGTC GTGCGCACAC CGCTGCAGTT TTCGGAGCGA
CTGTCGGAGG TCACCGGCGC GCAGGTCTAC CTCAAGCGCG AAGACCTGCA GGCGGTGCGG
TCCTACAAGC TGCGCGGGGC GTTCAACCTG ATCTCGCAGC TCACGGAGGA GGAGATCGCC
GCGGGCGTGG TCTGCTCGTC CGCAGGCAAC CACGCCCAGG GGTTCGCCAT GGCGTGCCGG
ACGATGGGCA TCAAGGGCCG CGTCTACATC CCGGCCAAGA CTCCGAAGCA GAAGCGCGAC
CGCATCCGCT ACCACGGCCG CGAGTTCATC GAGCTGATCG CCGTCGGGAC CACCTACGAT
CTGGCGGCCG CGGCGGCGAT CGACGACGTC GCGCGCACCG GCGCCACCCT CGTCCCGCCC
TATGACGACG TGCGCACCAT GGCCGGGCAG GGCACCATCG CCGCCGAGAT CCTCGACGAC
CTCGACGCCG AACCGGATCT GGTGATCGTC CCGGTGGGCG GCGGCGGCTG TATCGCAGGC
ATCACCACCT ACCTTGCTGA ACGCACGTCC GGCACCGCGG TGCTGGGTGT GGAACCGGCC
GGGGCGGCGT CGATGATCGC GGCGCTGAGC GCCGGTGAAC CGGTCACCCT CGACCACGTC
GACCAGTTCG TCGACGGCGC GGCGGTCAAC CGCGCCGGAC GCCTGCCGTT CGCCGCGCTG
CAGGCCGCGG GCGACATGGT GTCGCTGACC ACCGTCGACG AGGGTGCGGT CTGCTCCGCG
ATGCTCGACC TGTACCAGAA CGAGGGCATC ATCGCCGAAC CCGCGGGCGC GCTGTCGGTC
GCCGGGCTGC TCGAGAACAA CGTCGAACCG GGCTCGACGG TCGTCTGCCT GATCTCCGGC
GGGAACAACG ACGTATCGCG CTACGGCGAG ATCCTCGAGC GCTCGTTGGT GCACCTCGGG
CTCAAGCACT ACTTCCTGGT CGACTTCCCG CAGGAGCCCG GTGCGCTGCG CCGGTTCCTC
GACGAGGTGC TCGGCCCGAA CGACGACATC ACGTTGTTCG AGTACGTCAA GCGCAACAAC
CGGGAGACGG GGGAGGCGCT CGTCGGGATC GAACTCGGCT CGGCCGCCGA CTTCGACGGT
CTGCTCACCC GGATGCGCTC CTCCGACATG CACGTCGAGG CGCTGGAACC GGATTCGCCG
GCCTACCGCT ACCTGCTCTG A
 
Protein sequence
MSAELSQTPR TSPITAADID EAAQRIVDVV VRTPLQFSER LSEVTGAQVY LKREDLQAVR 
SYKLRGAFNL ISQLTEEEIA AGVVCSSAGN HAQGFAMACR TMGIKGRVYI PAKTPKQKRD
RIRYHGREFI ELIAVGTTYD LAAAAAIDDV ARTGATLVPP YDDVRTMAGQ GTIAAEILDD
LDAEPDLVIV PVGGGGCIAG ITTYLAERTS GTAVLGVEPA GAASMIAALS AGEPVTLDHV
DQFVDGAAVN RAGRLPFAAL QAAGDMVSLT TVDEGAVCSA MLDLYQNEGI IAEPAGALSV
AGLLENNVEP GSTVVCLISG GNNDVSRYGE ILERSLVHLG LKHYFLVDFP QEPGALRRFL
DEVLGPNDDI TLFEYVKRNN RETGEALVGI ELGSAADFDG LLTRMRSSDM HVEALEPDSP
AYRYLL