Gene Mkms_5077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5077 
Symbol 
ID4612760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5312851 
End bp5314047 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content70% 
IMG OID639794774 
Productcysteine desulfurase family protein 
Protein accessionYP_941056 
Protein GI119871104 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATACG ACGTCGCCCG GGTGCGTGGA TTGCACCCCT CATTGGGCGA CGGTTGGGTG 
CACTTCGACG CCCAGCACGG GATGCTGCTG CCCGACGCGG TGGCCACGAC GGTCTCCACC
GCGTTCCGGG GGTCGATGTC GACCACCGTG GGCCCGCATC CGTCGGCGCG GCGCAGCGCC
GCGGTGCTGC ACGCGGCCCG CCAGGCGGTC GCCGACCTGG TGAACGGCGA TCCCCGCGGT
GTGGTGCTCG GCGCCGACCG CGCGCTGCTG CTGGCCTCGC TGGCCGATGC CGCGTCGTCG
CGGGTGGGTC TGGGCTACGA GGTGGTCGTC ACCCGACTGG ACGACGAGGC GAACATCGCG
CCGTGGCTGC GTGCGGCCAA CCGGTACGGC GCCAAGATCA AGTGGGCCGA GGTCGACATC
GAAACCGGTG AACTGCCCGC CTGGCAGTGG GAGGGGTTGA TCACCGGCCC GACCCGCCTG
GTGGCGATCA CATCGGCGTC CTCGACCATC GGCACGGTCA CCGATCTGCG GGCGGTGACC
AAACTCGTAC ACGAGGTGGG CGGTCTGGTC GTCGTCGACC ACTCCGCGGC GGCCCCGTAC
CGGCTGATCG ACCTCGAGGA GATCGACGCG GACGTCGTGG CGCTCAACGC GGTGGCATGG
GGCGGTCCGC CGATCGGCGC GCTGGTCTTC CGCGATCCGT CGACCATCGA ACAGTTCGGT
TCGGTGTCGC TGGATCCGTA TGCGACCGGG CCGGCCCGCC TGGAGGTCGG GGTGCACCAG
TTCGGCATGC TCGCCGGGGT GGTGGCCAGC ATCGAGTATC TGGCGGGTCT CGACGAGAAC
GCCACCGGCA CCCGGCGCGA GCGGCTGTCG CTGTCGATGC AGTCCGCCAC CTCGTACATG
AGCAGGCTCT TCGACTACCT GTTGATGTCG CTGCGCTCGC TACCGCTGGT GATGGTGATC
GGTCAGCCCG AGGTTCGCAT CCCGACGCTG AGTTTCGCGG TCCGCGACGT CCCGGCCGAG
AAGGTGGTGC AGCGGCTCGC CGACAACGGT GTGCTGGCCA TCGCGAACGC GAACTCCCGG
GTCCTCGACG TCATCGGCGT CGACGACATC GGCGGAGCCG TGACGATCGG GCTTGCGCAC
TACACCACCA CCGCCGAGGT CGACCAGCTG GTGCGCGCAC TGGCGTCGCT GGGCTGA
 
Protein sequence
MAYDVARVRG LHPSLGDGWV HFDAQHGMLL PDAVATTVST AFRGSMSTTV GPHPSARRSA 
AVLHAARQAV ADLVNGDPRG VVLGADRALL LASLADAASS RVGLGYEVVV TRLDDEANIA
PWLRAANRYG AKIKWAEVDI ETGELPAWQW EGLITGPTRL VAITSASSTI GTVTDLRAVT
KLVHEVGGLV VVDHSAAAPY RLIDLEEIDA DVVALNAVAW GGPPIGALVF RDPSTIEQFG
SVSLDPYATG PARLEVGVHQ FGMLAGVVAS IEYLAGLDEN ATGTRRERLS LSMQSATSYM
SRLFDYLLMS LRSLPLVMVI GQPEVRIPTL SFAVRDVPAE KVVQRLADNG VLAIANANSR
VLDVIGVDDI GGAVTIGLAH YTTTAEVDQL VRALASLG