Gene Mkms_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0041 
Symbol 
ID4615608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp48259 
End bp50037 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content65% 
IMG OID639789718 
Producthypothetical protein 
Protein accessionYP_936050 
Protein GI119866098 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00752533 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCACCG ACTTCGAAAA GTCATACCGC CTGTCAGACA GCCTCTTCCG TGAGGCCAAG 
GTCATCATCG AACGATCACA CGCGGCGGAG ATGATCAACG ATTTCCATGA CCAGCACCGC
GGCGCCGGTG GACAAGACTG CGCCGGAATC GCCTACACCA TCTCCGCCGT CCTCGTCGCC
GCCCTGGGAC TGTTGATGCT TGGGCGAACC CCGACCTACA AGGCCATCCA GCAGGCCATC
GGCGACCTCA CCCCGCGACA ACTCGCCGAA GTCGGTATGG CAGGACAAGA CACCTCCCGC
ATCTTCGCCG ACCGCGCCGA GCAACGCCGC GAGCGGACAC GCTTCGTTGC CTGGCTCAAC
CGCCGCCTCG CACCACTCGA TCCCAGCCCC GACCAACCAG CACGACGAAT CACCAACGCC
GAACACCGCC GCATCATCGC CCACCGCAGC ACCGAACAAC GCGCAGCGAG CGGAGAAGCC
ACCGAACGCC TGCGAACGGT CATGAATCGC ATCGTCTTCG GGTCCATCGT CGATCCGCAG
CCACCCGAGG CCGTCGGCGA CATCGTGGCC GACGAGACGA TCTTCGACCT CGCCGGCCCC
TCAGCTGGGC TGGGGATAAA ACCCGACAAA CAGCGCGGAG CCGCATACTT CGGCAGCTAC
TACTCCCGAG ACAAGCACTC CACCACCGTG TATCAAGATG CCAGCGGCGG CGGAAAGCGT
GGCTTCGGCA TCGGACTGAC CGCCGTGATA CGCGTTGGAC CACCGGACCA GCTGCACGCA
GTCGCGCCCG TCGTCATCGG GATGGACGTC CATCCACCCA CCTCCGGCAG CGTCGATGGA
CTCGACGTCG CTATCACTCA CGCCAAACGC AATGGACTCG ACACGCGGCG TGCGGGACGG
GCACGCCTGC CGTGTCTCAC TACGGATATG GGCTACAACC CCAAGGACGG CTTCGCACGC
CTGATGCTCG ACCACCAGTA CGCCGCCGTC GTGCGGTACC CACAACACTG GACACTCACC
GATACCGCGG CCAATCCACC CGGTGCCACC GCAGACGTCC CGCCCGGGCC CATCCAGTTC
GCCGGCTCGT TCTACTGTCC GGCCGCGGCC GACCTACTCG CCCAACACCA CATGCCCAAA
ACCCGCGACC TGCTGGCCGA CAACGGGTGG GAGGCGCACG ACCGCCGCCT GGCAAGTGTC
CTGCCCTTCC TGATGGGCCT GAACTCCCGG CCGCGACTGA GCCGACCACG TGGACGCCCA
CCCCTGGGAG TCGAACCACG ACTCGACGTG AAGGCGGAGC TGGTCTGCCC TGCCGTGCAG
CTGCGCGTCC AATGCCCACT CAAACCGGCG TCGATGACAC GCGCCGCCTT CGGCGCCCCG
ACAGCAGCGC CGACCTGGCA AGCAAGCGAT CGCCAATGCT GCGCCCAATC GATCGTCACC
GTCACCTTGA CACCCCGTCA GCTCAAAAAG GCCCAATGGG GACCCGTCGG CGCCAGTTGG
GAACACATCC TGTACTTCGA GGCAGCGCGC GCTCGAACCG AGCAGACATT CAGCATCCTC
AAGTCGGCCC ACGTCACCAA GCTCGTCGAC CTCAAGTGGG GTCCCCGACG CGAACCCATG
GTCAAGCTCC TGATGGCGCT CGCCGTAGCC AGCACTAATC ACCGCATCCA AAAGACCTAC
AGGTCACGCC AGGCTCGCGA AGAATCCATC GACGTCCGCC GACGCCAACT CCGCGACCAC
CTCGGACACG AACCCGCCAA GACACCGCCC CTGACCTAA
 
Protein sequence
MITDFEKSYR LSDSLFREAK VIIERSHAAE MINDFHDQHR GAGGQDCAGI AYTISAVLVA 
ALGLLMLGRT PTYKAIQQAI GDLTPRQLAE VGMAGQDTSR IFADRAEQRR ERTRFVAWLN
RRLAPLDPSP DQPARRITNA EHRRIIAHRS TEQRAASGEA TERLRTVMNR IVFGSIVDPQ
PPEAVGDIVA DETIFDLAGP SAGLGIKPDK QRGAAYFGSY YSRDKHSTTV YQDASGGGKR
GFGIGLTAVI RVGPPDQLHA VAPVVIGMDV HPPTSGSVDG LDVAITHAKR NGLDTRRAGR
ARLPCLTTDM GYNPKDGFAR LMLDHQYAAV VRYPQHWTLT DTAANPPGAT ADVPPGPIQF
AGSFYCPAAA DLLAQHHMPK TRDLLADNGW EAHDRRLASV LPFLMGLNSR PRLSRPRGRP
PLGVEPRLDV KAELVCPAVQ LRVQCPLKPA SMTRAAFGAP TAAPTWQASD RQCCAQSIVT
VTLTPRQLKK AQWGPVGASW EHILYFEAAR ARTEQTFSIL KSAHVTKLVD LKWGPRREPM
VKLLMALAVA STNHRIQKTY RSRQAREESI DVRRRQLRDH LGHEPAKTPP LT