Gene Mmcs_5550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5550 
Symbol 
ID4114418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp121644 
End bp123587 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content69% 
IMG OID638034705 
Producthypothetical protein 
Protein accessionYP_642706 
Protein GI108802510 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.429574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCA CCGCGGTCAG CACCGCCCCG GTCACCTCCT GGTGGTACGG GCAAGGACGC 
GCGGTCGACG TCGTGCGGCG CCTGGAATCG CTGTACACGC TCTCCCCGCC CCCCACCGCC
CCCGCCGGGC TGGTCACCGC CCGCCGAGAC GAACTGACCG TCACCTTCGT CGCCGGCCTG
AAATCCCCAC TGCCGCACCC ATGGCGGCCC GCCGACTCCA GCAACGAGGC ACGGATCCCC
TGGGAGGCCG TGGATCCGGG AAACGACCCC GACCCGGACC GGCCGGTGTA CCTGGTGGTG
TTCGGCACCA CCGACGATGG CGGACTGATC GGACTGAATC TGGCGGCCTT TCAACGGATC
CGCTTCGACG GTGACACCGC CACCGCCACC GCCCTGGTCA GCCGGTGGGT GCTGGAGCTG
GTGTCCACCC ACCCCGACAT CACCATCGGT GTCACCGCCG ACGTATGGAA CGGCCCCTTC
ACCACGCGGG TCCAGCCCGT GGCCGCGGGC CGGGTCCCGC AGGTCGACGT CCTGGTGTGC
GGACCCGCCC TGACGTACAC CGACCGGTCG CAGATCGTGT CCAGTGCCGC CAGCAAAATT
GTCATCGACC TGGGCAAAGA CGCCGCCGTA GATGCCCGCT GGACCATCAC CTGCGGCCCG
GACCGGCTCG GGCAGATCAG CAGCGAACGA TCGGCCAGGC CGATGACAGC GACGCTGATC
GTGCCCAGCG CCGCCACCGT GGACCGCTGC GCAGCGCTGC TGACCGACAC CTCAGCCCAG
GCCGCGGCCA CTCCGCCCGA TCCCACTTAC AGCGCGGCGG CCCCCGAAGC CCCCATCACC
GAGCTGCCGA CGGCCGACCT CGACGACCTC GACGACCCCG CCACCGACCC TCATCTGCCG
ATCCCGTCCG ACGTCGCGAC ACTGCACGAC GACGGAATCG ACTTCTTCGC AACACAACCC
GCCGCGGCGC CCAACGTCGG ACCCACACCG CAGCTGCAAC CCGCCCACAA CGACCCCACC
GACGCGGACC AGGAACGCGA CTGGCCCACC GACGACCTGG ACGGGTCGAC AGCTGCGGCC
GATGTGGGAC GCGCTGAGCT CAACCCCGCG ATCGGAGAGG GCAGCACACC GGCGGCCGAC
GCCACCGCAG AACCTGAACC GGCAGCCGCG CCGGCAACAG CGGCCCCAAA ATCACTCCCC
AGCGACACCG CCGACGGCGC CGCCCCCGTG GTCGCAACCA TCTGGAACAG GATCCTCGGC
CAAGTCGCCC TCGACCCCCC GCACGCCACC CAGCAGCCGG GCCCGCGAGA GAAACGACTC
AACGAGCTGA CGGTGTTCCT GCAACACAAC CCGTGGGTGA GCGCCACCGA CATCGTGCGC
CACATCTACG GCGGTGTGGC CGCGGACAAG ACGGTGACCC AACAAGTTTC GCTGCTGCGC
GCACGGCTCG GCGCCGTCTT CGCCGGCGGC CCCAAAGCGC TGCCACCCAT GACCGAGGGC
GGCTACCACC TCGACAACGC CGTGCGCTCG GACTGGATGG AGTTCGAGCG CCTTGTCGAG
ATCCTGCCCG AGACCACGCC CACGCCGAAC CTCGTCGCCG CCATGGATCT GGTCACCGGC
CCACCACTGG GGGGCATCGC GCCCAAGGAA TGGACCTGGA CCAAGGATCT GCGTGACGAG
CTGCGTGATC GCGTCGCCGG CGCCGCTGTT GTCCTGGCGC GCCGCCACCA TTCGGCGAAG
GCCTACAGCG CTGCCGTCGA GACCGCTCGC AAGGGCCTGT GGTACGACAA CGCCCGCCAG
GATCTGTGGC AGATCGGTAT GCAAGCGGCC CTGGATGGGC ACGACAAAGA CGCCTACAAG
ACCCTGCGCA CCCAATACCT AGCCGCAGTT CCCGGATCTG AACGGGACCC CGAAGTATTC
GATCTGACGA AACGAGCAGG GTAG
 
Protein sequence
MTSTAVSTAP VTSWWYGQGR AVDVVRRLES LYTLSPPPTA PAGLVTARRD ELTVTFVAGL 
KSPLPHPWRP ADSSNEARIP WEAVDPGNDP DPDRPVYLVV FGTTDDGGLI GLNLAAFQRI
RFDGDTATAT ALVSRWVLEL VSTHPDITIG VTADVWNGPF TTRVQPVAAG RVPQVDVLVC
GPALTYTDRS QIVSSAASKI VIDLGKDAAV DARWTITCGP DRLGQISSER SARPMTATLI
VPSAATVDRC AALLTDTSAQ AAATPPDPTY SAAAPEAPIT ELPTADLDDL DDPATDPHLP
IPSDVATLHD DGIDFFATQP AAAPNVGPTP QLQPAHNDPT DADQERDWPT DDLDGSTAAA
DVGRAELNPA IGEGSTPAAD ATAEPEPAAA PATAAPKSLP SDTADGAAPV VATIWNRILG
QVALDPPHAT QQPGPREKRL NELTVFLQHN PWVSATDIVR HIYGGVAADK TVTQQVSLLR
ARLGAVFAGG PKALPPMTEG GYHLDNAVRS DWMEFERLVE ILPETTPTPN LVAAMDLVTG
PPLGGIAPKE WTWTKDLRDE LRDRVAGAAV VLARRHHSAK AYSAAVETAR KGLWYDNARQ
DLWQIGMQAA LDGHDKDAYK TLRTQYLAAV PGSERDPEVF DLTKRAG