Gene Mmcs_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0037 
Symbol 
ID4108925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp47100 
End bp48971 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content69% 
IMG OID638029163 
Producthypothetical protein 
Protein accessionYP_637215 
Protein GI108797018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCGC ATGCGGTGCC GTCGGCGGCC ACGCACGGCG GCGGCTGGTT TACGAAACTT 
CCTGTGCTAC CGAATGCCGC CGGGGCCAGC GCGATCGCAC TTGCGGCGAC GCCTGCGAAC
AACGCGGGGG CGGCGGCTTT ACTCATTTTG TTTCGCATGA CTAAGGCTTT GCCGCCGTCA
CTGATCTTCA AACCACCCCG ATTGGTGACG CGTTCGACAA TGCCTCGGTT GGCCGGTTGC
GGTTTGGCGC AGGCCCGCCG CGGCTATGCC CGTCCGGAAG ACGGATGCGT TGAGCTGACG
AGGTGCTGGA CTGTGACGAC TATCGAGCAT CAGGATCCGG GGGCTTCGGC AGAGTGGCCG
ACGCAACCGC TTCCTGCTCC GGGACGCGCG CTCAATCCCG CGACCACCCA GGTACCCGGA
TGCATCGTCG GCGGTCGCTA TCGGCTTCTC GTCTGTCACG GCACCTCCGG GGTGCTCGAA
TTCTGGCAGG CCGTCGACCT GGCCGCGGGT CGTGCGGTGG CGGTGACGCT GGTGGATCCG
CGCTCGAGCC TGCCGATCGA ATGGGTCAAC GAAATTCTGT CGCTCACCGT CCGACTGCGC
GGGGTCGACG CACCCGGTAT CGCGACGATC CTGGACGTCC TCCATACCGG GCAATGCGGT
GTCGTCGTGG CGAATTGGGT GGCCGGGGGC AGTCTGCGGG AAGTGGCGGA CACCGACCCT
GCGCCGGAAG CCGCCGCCGC AGCGCTGGAA TCGCTCGCCA TGGCCGCCGA GGCCTCACAT
CGCGCCGGTA TGCACCTGAG TATCGACCAT CCGGCACGCG TCCGGATCAG TTCGGACGCC
CGCGCGGTGC TGGCGTTCCC GGCGACCCTG CCCGACGGCA CACCGCACAC CGACCTGCGC
GGTATCGGTG GTTGCCTCTA CGCGCTGCTC GCGAAATGTT GGCCCGAAGA CGTGTCGGAA
CCCGCGCAGT TGGCCGAGCT GCGACCTGGG ACACCGTATT TGGTGTCGGC GACGGCCTCC
GGTCTGGCGC GGCCCGAACC CGGGATCACG AGTGCCGCGA CAGTTCTGAC GATGCTTCGG
CAGGCCGCGC GCGACGGCCG CGACGACGGT GCCGACATCC GCGTCCTGCC GCCGCTGGAT
CCACCTCCAC CGGGGGTGTA TGCCGCGTTC CGCAACTTCG GACCGGACGA ACAGGCGCAG
GAAGCCCGCA AGAGCGTGCT GCGCGCGACG ATCGGCACCG CCGCGGCGGT CGTCGCGGCG
GGTGTGGTGA TGCTGGGCTC GAGCGTCAAC GAGGTGTTGC AGGCTCCTGA CGAGACCCCG
GCGATGGATG CCGGGCAACT GGGCTTGCAG GACGGCCCGA CGTCGGTCGC ACCGAAGGTC
CCCGAGGAGA CGGTCAAGGC GGCCGCGGCG GATGCGCCGG TCAAGCCGGT GAGGGCGGAG
GTGTTCTCGA CCGACGGCAG GCCGGACAAC CCCGGGGAAG CGGGTCGCGT GATCGACGGC
GACCCAGCCA CCGCGTGGTC GACAGACCGG TACTACGACG CCGATCCCTT CCCCGAGTTC
AAGGAGGGGC TGGGGTTGAT GCTGGAGTTG CCGGCCCCGA CGACGCTCGG ATCGGTCGAG
GTCGATCTCG GGAGCACCGG TACGGTCGTG CAGGTTCGGT CGGCGCCCGG ACAGTCCCCC
GCCGCGCTCG GCGACACCAC GGAGATCAGC GCGCCCACGC CGATGCAGCC GGGCCGCAAC
ACCATTGCGG TGCGCAGCCG TGAACCGGTC AAGCACGTCC TCGTGTGGGT GAGCACCCTG
GGCAGCACCG ACGGCCGCAA TCAAGCCGCG GTCTCCGAGA TCACGCTGCA TCCGCCCGCG
CCGCCGGCGT AG
 
Protein sequence
MAAHAVPSAA THGGGWFTKL PVLPNAAGAS AIALAATPAN NAGAAALLIL FRMTKALPPS 
LIFKPPRLVT RSTMPRLAGC GLAQARRGYA RPEDGCVELT RCWTVTTIEH QDPGASAEWP
TQPLPAPGRA LNPATTQVPG CIVGGRYRLL VCHGTSGVLE FWQAVDLAAG RAVAVTLVDP
RSSLPIEWVN EILSLTVRLR GVDAPGIATI LDVLHTGQCG VVVANWVAGG SLREVADTDP
APEAAAAALE SLAMAAEASH RAGMHLSIDH PARVRISSDA RAVLAFPATL PDGTPHTDLR
GIGGCLYALL AKCWPEDVSE PAQLAELRPG TPYLVSATAS GLARPEPGIT SAATVLTMLR
QAARDGRDDG ADIRVLPPLD PPPPGVYAAF RNFGPDEQAQ EARKSVLRAT IGTAAAVVAA
GVVMLGSSVN EVLQAPDETP AMDAGQLGLQ DGPTSVAPKV PEETVKAAAA DAPVKPVRAE
VFSTDGRPDN PGEAGRVIDG DPATAWSTDR YYDADPFPEF KEGLGLMLEL PAPTTLGSVE
VDLGSTGTVV QVRSAPGQSP AALGDTTEIS APTPMQPGRN TIAVRSREPV KHVLVWVSTL
GSTDGRNQAA VSEITLHPPA PPA