Gene Mmcs_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2106 
Symbol 
ID4110939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2255515 
End bp2256855 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID638031227 
Producthypothetical protein 
Protein accessionYP_639270 
Protein GI108799073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.426457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGGAT TGCCTGGCGC GGTCGACCGC CTACTGGCCG CGGTCGCTGA GCTGCAGACC 
GCCTCCATCG ACGAGCTCTC CTACGAGCAG ATCGTGGCCG AACTGGACCG CATCAAAGCA
GCGGTGTGGG CGGTGCCCAG TGTGGAGCAC CGCCTGACCG CGCGGCTGAT GGACGCCGAC
CCCCATGAAC TCGGCGCCAC CTCTATCAAG GAGGTGCTGG CCAACCACCT ACGCATCTCC
CGCAAAGCCG CCGGCGAGCG GCTCACCGAC GCGCGCCAGT TGGGGCCGCG CTACACCCTG
ACCGGGGAGC GGGTGCAGAC CGAGTTGGCC CACACCGCCG CCGCGGTCGC CCGCGGCGAC
ATCGGCACCG CCCATGTGCG CATCATCCAG GACTTCGTCA GGAAACTTCC GGCATGGGTG
TCCTGGGAGC GCCGCGACCA CTACGAACGC GACCTGGTCG GCCACGCCAG CGCACTGCGG
CCCGAGGACC TCCGCAAGGT CGCCGACACC CTGCTGGGGT TCATCGATCA GGACGGCACC
GAACCCGACC ACCACACCCA GCAACGCCGC CGCGAGTTCA CCGTCGGCCG CCAGCAGGCC
GACGGGATGA GCCGGGTCTC GGGCTGGCTC ACCCCCGAAG CCCGCGCGCA CTGGGATGTC
ATCGCCGCCA AGTACGCCGC CCCTGGCACC AATCTGCCCC ACGACGACGC CCACACCGGC
CGCGACGACC GCACTACCGG CCAACGCCAC CACGACGCCC TCACCCGAGC AATGCGGGAC
CATGTGCAGT CGGGCGCCCT CGGCCAGGTC GCCGGCGTTC CCGCCAGCAT CGTCGCGACG
ATGACGCTCA GCGAGCTTGA ACGTGCCGCC GGGTGGGCGC ACACCGGCGG CGGCAACAAG
ATCCCCATCC GCGATCTGAT CCGCATGGCC GCCCACTCCC GGCACTACCT GGCGGTGTTC
GACGACCACA CCGAAGAAAT CCTGTATTTC GGCCGCGCCC GCCGCACGGC GTCGACCGCG
CAACGCCTGG CCCTGTTCGC CCGCGACAGG GGCTGCACCC ACCCGGGCTG CACCGTGCCG
TTCTATTGGA CCGAAGCCCA CCACACCCAC GACTACTCCC GCGGTGGGCG CACCGACATC
GACGACCTCA CCCTGGCCTG CCAACCCGCC AACCTGCTCA TCGAGAAAAC CGGCTGGACC
ACCCACCGGC CCGGCAACGG CCGCACCCAA TGGACCCCAC CCGCCGACCA CGACACCGGC
CAACCCCGCA TCAACAACCA CTTCCACCCC CACCGCTACC TCACCGACAA CGACGACGGT
CAAGACGACG AACCCGAATA A
 
Protein sequence
MDGLPGAVDR LLAAVAELQT ASIDELSYEQ IVAELDRIKA AVWAVPSVEH RLTARLMDAD 
PHELGATSIK EVLANHLRIS RKAAGERLTD ARQLGPRYTL TGERVQTELA HTAAAVARGD
IGTAHVRIIQ DFVRKLPAWV SWERRDHYER DLVGHASALR PEDLRKVADT LLGFIDQDGT
EPDHHTQQRR REFTVGRQQA DGMSRVSGWL TPEARAHWDV IAAKYAAPGT NLPHDDAHTG
RDDRTTGQRH HDALTRAMRD HVQSGALGQV AGVPASIVAT MTLSELERAA GWAHTGGGNK
IPIRDLIRMA AHSRHYLAVF DDHTEEILYF GRARRTASTA QRLALFARDR GCTHPGCTVP
FYWTEAHHTH DYSRGGRTDI DDLTLACQPA NLLIEKTGWT THRPGNGRTQ WTPPADHDTG
QPRINNHFHP HRYLTDNDDG QDDEPE