Gene Mmcs_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4989 
Symbol 
ID4113818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5275229 
End bp5276425 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content70% 
IMG OID638034147 
Productcysteine desulphurase-like protein 
Protein accessionYP_642149 
Protein GI108801952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATACG ACGTCGCCCG GGTGCGTGGA TTGCACCCCT CATTGGGCGA CGGTTGGGTG 
CACTTCGACG CCCAGCACGG GATGCTGCTG CCCGACGCGG TGGCCACGAC GGTCTCCACC
GCGTTCCGGG GGTCGATGTC GACCACCGTG GGCCCGCATC CGTCGGCGCG GCGCAGCGCC
GCGGTGCTGC ACGCGGCCCG CCAGGCGGTC GCCGACCTGG TGAACGGCGA TCCCCGCGGT
GTGGTGCTCG GCGCCGACCG CGCGCTGCTG CTGGCCTCGC TGGCCGATGC CGCGTCGTCG
CGGGTGGGTC TGGGCTACGA GGTGGTCGTC ACCCGACTGG ACGACGAGGC GAACATCGCG
CCGTGGCTGC GTGCGGCCAA CCGGTACGGC GCCAAGATCA AGTGGGCCGA GGTCGACATC
GAAACCGGTG AACTGCCCGC CTGGCAGTGG GAGGGGTTGA TCACCGGCCC GACCCGCCTG
GTGGCGATCA CATCGGCGTC CTCGACCATC GGCACGGTCA CCGATCTGCG GGCGGTGACC
AAACTCGTAC ACGAGGTGGG CGGTCTGGTC GTCGTCGACC ACTCCGCGGC GGCCCCGTAC
CGGCTGATCG ACCTCGAGGA GATCGACGCG GACGTCGTGG CGCTCAACGC GGTGGCATGG
GGCGGTCCGC CGATCGGCGC GCTGGTCTTC CGCGATCCGT CGACCATCGA ACAGTTCGGT
TCGGTGTCGC TGGATCCGTA TGCGACCGGG CCGGCCCGCC TGGAGGTCGG GGTGCACCAG
TTCGGCATGC TCGCCGGGGT GGTGGCCAGC ATCGAGTATC TGGCGGGTCT CGACGAGAAC
GCCACCGGCA CCCGGCGCGA GCGGCTGTCG CTGTCGATGC AGTCCGCCAC CTCGTACATG
AGCAGGCTCT TCGACTACCT GTTGATGTCG CTGCGCTCGC TACCGCTGGT GATGGTGATC
GGTCAGCCCG AGGTTCGCAT CCCGACGCTG AGTTTCGCGG TCCGCGACGT CCCGGCCGAG
AAGGTGGTGC AGCGGCTCGC CGACAACGGT GTGCTGGCCA TCGCGAACGC GAACTCCCGG
GTCCTCGACG TCATCGGCGT CGACGACATC GGCGGAGCCG TGACGATCGG GCTTGCGCAC
TACACCACCA CCGCCGAGGT CGACCAGCTG GTGCGCGCAC TGGCGTCGCT GGGCTGA
 
Protein sequence
MAYDVARVRG LHPSLGDGWV HFDAQHGMLL PDAVATTVST AFRGSMSTTV GPHPSARRSA 
AVLHAARQAV ADLVNGDPRG VVLGADRALL LASLADAASS RVGLGYEVVV TRLDDEANIA
PWLRAANRYG AKIKWAEVDI ETGELPAWQW EGLITGPTRL VAITSASSTI GTVTDLRAVT
KLVHEVGGLV VVDHSAAAPY RLIDLEEIDA DVVALNAVAW GGPPIGALVF RDPSTIEQFG
SVSLDPYATG PARLEVGVHQ FGMLAGVVAS IEYLAGLDEN ATGTRRERLS LSMQSATSYM
SRLFDYLLMS LRSLPLVMVI GQPEVRIPTL SFAVRDVPAE KVVQRLADNG VLAIANANSR
VLDVIGVDDI GGAVTIGLAH YTTTAEVDQL VRALASLG