Gene Mkms_2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2884 
Symbol 
ID4611073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3022132 
End bp3024978 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content70% 
IMG OID639792549 
Productglycine dehydrogenase 
Protein accessionYP_938868 
Protein GI119868916 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.198061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGACT CTCATCAGCC CCGTTTCGCC GACCGACACA TCGGTCCGGA CTCCGATGCC 
GTCGCGGTCA TGCTCGACAC CATCGGCGTG GCGACCCTCG ACGACCTCGC CGCCAAGGCG
CTGCCCGCGA ACATCCTCGA CGCCCTGTCG GCCGACGGGG TCGCGCCCGG CCTCGAATGC
CTGCCGGCCC CTGCGTCCGA GACCGAGGCG CTCGCGGAAC TCCGGGCGCT CGCGGAGTCG
AACACCGTCG CGGTGTCGAT GATCGGGCAG GGCTACTTCG ACACCCTGAC CCCGCCGGTG
CTGCGCCGCA ACATCCTGGA GAACCCGGCC TGGTACACCG CCTACACGCC GTACCAGCCC
GAGATCAGCC AGGGTCGGCT CGAGGCGCTG CTCAACTTCC AGACCATGGT GTCCGATCTG
ACCGGCCTCG AGGTCGCCAA CGCGTCGATG CTCGACGAGG CCACCGCCGC GGCCGAGGCC
ATGACGCTGA TGCAGCGCGC CGGCCGCAGC AAGTCGAACC GGTTGGCAGT CGACGCCGAT
CTGTACCGCC AGACCGCCGC GGTGCTCGCC ACCCGGGCCC GGCCCCTCGG GATCGAGATC
GTCACCGCCG ACCTGCGCCA CGGGCTGCCC GAGGGCGACT TCTTCGGCGT CATCGTCCAA
CTTCCCGGTG CGAGCGGAGC ACTCGTCGAC TGGGCGCCGC TGGTGGCCGA CGCGCACGAC
CGTGGTGCCC TCGTGGCAGT CGGCGCCGAT CTGCTGGCAC TGACGCTGGT CACCCCGCCG
GGTGAGTTCG GCGCCGATGT CTCCTTCGGC AGCACGCAGC GATTCGGTGT GCCGATGGGG
TTCGGCGGCC CGCATGCCGG TTATCTCGCG GTGCACACCA AACACGCCCG GCAGCTGCCC
GGCCGCCTCG TCGGGGTGTC CGTGGATGCG GACGGATCGC CGGCGTACCG ATTGGCGCTG
CAGACCCGCG AACAGCACAT CCGCCGGGAC AAGGCGACCA GCAACATCTG CACCGCCCAG
GTGCTGCTGG CGGTCATCGC GGCGATGTAC GCGAGCTATC ACGGCGCCGA AGGTCTGACC
TCGATCGCCC GCCGCGTGCA CGGGCACGCC CGCGCGGTCG CCACCGGACT CGCCGAGGCC
GGGGTCGACG TGGTGCACGA CGCATTCTTC GACACCGTGC TGGTCAAGGT GCCGGGCCGG
GCCGCCGAGA TCCGCGACGC CGCGAAGTCG CACGGCATCA ACATCTGGCT GGTCGACGCC
GATCACGTAT CGGTGTCCTG CGACGAGGCG ACCACCGCCG ACCACGTCGC CGCGGTGCTC
TCGGCGTTCG GCGCCACCCG CGGCGGCAAG CCCTTCGCCG GACCCGAGAT CGCCACGCGC
ACATCGGCAT TCCTCACCCA TCCGGCGTTC ACCCGGTACC GCACCGAAAC CGAGATGATG
CGCTACCTGC GGTCGTTGGC CGATAAGGAC ATCGCACTGG ACCGCAGCAT GATCCCGCTC
GGTTCCTGCA CCATGAAGCT CAACGCCGCC GCGGAGATGG AGCCGATCAC CTGGCCGGAG
TTCGCCAGGC AACACCCGTT CGGGCCCGAG AGCGATGCGC CGGGCCTGCG CAGGCTGATC
GCCGATCTGC AGACCTGGCT GACCGGTATC ACCGGCTACG ACGAGATCTC GCTGCAGCCC
AACGCCGGAT CCCAGGGCGA GTACGCCGGC CTGCTGGCGA TCAAGGCCTT CCACGACGCC
AACGGCGCAC CCGAACGCGA CGTGTGCCTG ATCCCGTCGA GCGCCCACGG CACCAACGCG
GCCTCGGCGG CGTTGGCCGG GATGCGGGTC GTTGTGGTGG GGTGCCGGCC GAACGGTGAC
GTCGACCTGG ACGACCTGCG CGCCAAGGTC ACCGAGAACG CCGAGCGCCT GGCCGCACTG
ATGATCACCT ATCCGTCGAC GCACGGTGTG TACGAGCACG ACGTCGCCGA GATCTGCGCG
GCCGTGCACG ATGCGGGCGG TCAGGTCTAT GTAGACGGCG CGAACCTCAA TGCGTTGGTC
GGACTGGCCC GGCCGGGCCG ATTCGGCGGC GACGTCAGCC ACCTCAACCT GCACAAGACG
TTCTGCATCC CGCACGGCGG CGGCGGTCCG GGTGTCGGGC CCGTCGCGGT GCGCGCCCAC
CTGGCGCCGT ACCTGCCGGG CCACCCGCTC GCCGACGAAC TCGGTGACGA CTACACCGTC
TCGGCCGCGC CCTACGGGTC GGCGTCGATC CTGCCGATCA CGTGGATGTA CATCCGCATG
ATGGGAGCGC CGGGTCTGCG GGCGGCGTCC CTGACGGCGA TCGCCTCGGC CAACTACGTC
GCCCGCCGGC TCGACGAGTA CTACCCGGTG CTCTACACGG GCGAGAACGG GATGGTGGCC
CACGAGTGCA TCCTGGACCT GCGGACCATC ACCAAACAGA CCGGCGTCAC CGTCGACGAC
GTGGCAAAGC GGTTGGCGGA CTTCGGCTTC CACGCACCGA CGATGAGCTT CCCAGTGGCG
GGGACGCTGA TGGTCGAGCC GACCGAAAGC GAATCGCTGA GCGAGGTCGA CGCGTTCTGT
GAGGCGATGA TCGCCATCCG GGCCGAGATC GACCAGGTCG GGTCGGGCGC CTGGCCGGTG
GACGACAACC CGTTGCGCGG TGCGCCGCAC ACCGCGGAGT CACTGCTGGT GACCGAGTGG
GAGCACCCGT ACACCCGCGA GCAGGCGGCC TACCCGCTGG GCAAGGGCTT CCGGCCCAAG
GTGTGGCCGC CGGTACGGCG GATCGACGGG GCCTACGGCG ATCGCAACCT GATGTGCTCG
TGCCCGCCGG TCGAGGCGTT CGCGTAA
 
Protein sequence
MLDSHQPRFA DRHIGPDSDA VAVMLDTIGV ATLDDLAAKA LPANILDALS ADGVAPGLEC 
LPAPASETEA LAELRALAES NTVAVSMIGQ GYFDTLTPPV LRRNILENPA WYTAYTPYQP
EISQGRLEAL LNFQTMVSDL TGLEVANASM LDEATAAAEA MTLMQRAGRS KSNRLAVDAD
LYRQTAAVLA TRARPLGIEI VTADLRHGLP EGDFFGVIVQ LPGASGALVD WAPLVADAHD
RGALVAVGAD LLALTLVTPP GEFGADVSFG STQRFGVPMG FGGPHAGYLA VHTKHARQLP
GRLVGVSVDA DGSPAYRLAL QTREQHIRRD KATSNICTAQ VLLAVIAAMY ASYHGAEGLT
SIARRVHGHA RAVATGLAEA GVDVVHDAFF DTVLVKVPGR AAEIRDAAKS HGINIWLVDA
DHVSVSCDEA TTADHVAAVL SAFGATRGGK PFAGPEIATR TSAFLTHPAF TRYRTETEMM
RYLRSLADKD IALDRSMIPL GSCTMKLNAA AEMEPITWPE FARQHPFGPE SDAPGLRRLI
ADLQTWLTGI TGYDEISLQP NAGSQGEYAG LLAIKAFHDA NGAPERDVCL IPSSAHGTNA
ASAALAGMRV VVVGCRPNGD VDLDDLRAKV TENAERLAAL MITYPSTHGV YEHDVAEICA
AVHDAGGQVY VDGANLNALV GLARPGRFGG DVSHLNLHKT FCIPHGGGGP GVGPVAVRAH
LAPYLPGHPL ADELGDDYTV SAAPYGSASI LPITWMYIRM MGAPGLRAAS LTAIASANYV
ARRLDEYYPV LYTGENGMVA HECILDLRTI TKQTGVTVDD VAKRLADFGF HAPTMSFPVA
GTLMVEPTES ESLSEVDAFC EAMIAIRAEI DQVGSGAWPV DDNPLRGAPH TAESLLVTEW
EHPYTREQAA YPLGKGFRPK VWPPVRRIDG AYGDRNLMCS CPPVEAFA