Gene Mkms_5501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5501 
Symbol 
ID4610322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp3679 
End bp6267 
Gene Length2589 bp 
Protein Length862 aa 
Translation table11 
GC content67% 
IMG OID639789166 
Producthypothetical protein 
Protein accessionYP_935501 
Protein GI119854896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.432777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0032369 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGTGC CCGCCCCGCT GCGCGCGATC GGCAACCTGC GGCTGACCCC GCACGGGGTC 
TACGCCGACT ATCTGCTCTC CGGACAGCCG TTCATCTTCC TCTCCGAAGA GTGGCAGGAC
CGGGTGGCTG CCGAGCACGC CGAGCTGTGG CGTGCGCTGC CGTCCGGATC GTCGATCAGT
GGCCTGACGG TGCCGGTGGC CCCGCGCGCC ACCGTCCGCA AAATGCTGTA CAGCCACCCC
GATTTGCGCC CCGGCGCGGC CGTGCCCGAG GGCGTGTCCG AGGCGGCGCG GCCCTGGGTG
CAGCACTGCC GCAGCTGGGA ACCCACCATC GCCGGGCACC GCGCTCGCCG CCGTATCTAC
TGGCTGAGCC TGCCCCTGGA TTACGGGCTG GCCGGGCGCA CCCCCAGCGG CACGTGGCGG
CGCATGGTCG ACGCCGCACT CGGCCGTGAC AAGGACACCG ACTCCTCCAT CGCCTACTAC
CGTGACCTGG CCGCCCAGAT GGTCGCCGCG CTGCCTGCGG TGTTCTTTCC CAAACCGGCC
ACGGTGGAAC AGATCTGGTG GCATTGGAAC TACATCGCCA GCCGCGGCGC CTGGGATGCG
CCGCTGCCGA CACAACCGTT CAATCCCGAC GCCACCCTGC CGGGGTCGGC GTTCACCCCG
GTCCATCTGG ACCCGGGTGC GGCTCAGCTG CGCGACCGGC GCTGGCGGGC AGCCCGCACC
GACGCCGACG TGTTCGTACG TACCTTCCGG GATCGCACCG ACGGTGTTGC GGACTCCTAT
CAGGCGCTGC TTCCCTTAGA CAGCTTCCCG GACAACGGCA TTGCCTGGCC GCGATCGACA
CTGTTCAAGG TTCTCGATGA CCTCACGACA CCGACCACGG TCTTGGACTG GACGATCAAC
ATCACTTTCA CCAGCGCGGA CGTGGCCGTG TCGACCGCAG AGAACGTCAT CGTCAACATT
CGCGACCAGT ACCGCCAGCG CGGCCGCCAC GCATCGAGTT CCGACGAGCT GCTGCGCAAG
CTGGCCTCCG GGCGGGAGCT GGCCTCAGAA CTCAAACGCG GTAGCGCCGA GCGTGGTGTG
AACGCCGCTA TCGTCATCGC CGCGGCCGCC GGTGACCCGG ATACGGTGAA CCGGGCCGTG
GCCGACGTCG CCCGCACCTA CCGCGGCCAG AACATCGGCT CGAAACGGTG GCGCGGCAGC
CAGCCCACAT TGTGGCGGGC GTTCGCCCCG GGCGGGGAGC GCCGCGCCGC CCTCGACGAG
TTCCGCAACC CGACCACCAC CAAGCGGTTT GCGCCGTTCG TCCCGCTGCT GGCGAGCAAG
CTCGGAAACA ACACCGGCGT CCCGTTGGGG ATGAACCTGA CCAGCCCGGG GCTGCGCGAC
GTCGTTCTCC TCGATGTCCT CAACGCGCCG GCCCGCGAGA ATCCGGCGAA TCTGGTGATC
TGCGGCTCCC CGGGCCGCGG CAAATCGCAC GCGACGAAGA ATTTGAGCCG CTCGTGGCTC
AAACTCGGCG CCGGTCTGCA TTTATTCGAT CCCACCGACG CCCGCGAACA CGAAACAGCG
TTGGCCGATT TCGACGATAA AGTCGTCATT GATGTCAGTC GCATGAATTT CAGTCTCGAT
GGATTGCGGG TTTTTCCTTA TAAAGAAGCC GCAGAACGAA CCATCGACCA TTTGCTACCG
CAATTGGGAT TGTCGCCATT GAGCCGGGGC GCTCAGCGGC TGTGGGGGCT GCTGGCCCCG
GAGTCACGCG AGGCCAACGG CATCGGCAGC ACCGCGCAGC TGATCAGATA TCTCCGCGAC
ATGCCCACAG CGCGGCGCAC CGACGCCGAC GAAGATCTGC TCATCGGGTT GGAGGGCCTG
GCCGCCCAAC GCCTGCTGCG GCCACTGTTT GATGAGTCTC TGCCCGTTCC CGACATCGCC
ACCACCCAAT GCGTGATCTG GAATTTCGCC GGACTCAAGC TGCCCACGGT CACCGAGGAA
TACCAGGCCC ACCTGCATCA GCAGACCACC CCGGGCCAGC GCGCCGCCCA AGCGCTCTAC
GGGCTGGGCG CCGAAGTGGC GCAGTCGATC TTTTTCGGCC GCCCCGATCA GCCCGACATG
CTGGTCGTCG AGGAGTGCGC AGCGTGGACC AACTCTCCGG GCGGGCAGAA GTGCGCGAAC
ACGATCATCC GCCAGGGCCG TAAGGCCTGG ACGGGGTTCT GCGGTATCAG CCAGCAGCCG
ATCAAAGACT TCGCCGTGCT GGAGGACGAG TTCATCGATC AGCGACTGTG CTTGGGGTTC
AAGCGATCTG ACATCGCCAA AGCAACCTTG CAGTGGTGTG ACCGCGACCT GGACCGCCAC
CCGGAGCTGC TGGCCAACTA CGTCAACAAC ACCAGCCCCG TGCAGCTGGT CGACCACGGC
GACGATGCGA TCGATGACCG CTACGGAAAG GTGATCCCCG GCCGCGAGGG CGAAGCGTGG
TTCCTCGACG AGTTCGGTGG CTTCGGCAAG GTGGCGCTGT TTGCAGCCCC GACCGCAGCA
CTGGCCGCCC GCTTCGACAC CAACCCCCAC CGAGCTCGGC AGCGCAGCCA GGCCACGCAG
CGATCATGA
 
Protein sequence
MSVPAPLRAI GNLRLTPHGV YADYLLSGQP FIFLSEEWQD RVAAEHAELW RALPSGSSIS 
GLTVPVAPRA TVRKMLYSHP DLRPGAAVPE GVSEAARPWV QHCRSWEPTI AGHRARRRIY
WLSLPLDYGL AGRTPSGTWR RMVDAALGRD KDTDSSIAYY RDLAAQMVAA LPAVFFPKPA
TVEQIWWHWN YIASRGAWDA PLPTQPFNPD ATLPGSAFTP VHLDPGAAQL RDRRWRAART
DADVFVRTFR DRTDGVADSY QALLPLDSFP DNGIAWPRST LFKVLDDLTT PTTVLDWTIN
ITFTSADVAV STAENVIVNI RDQYRQRGRH ASSSDELLRK LASGRELASE LKRGSAERGV
NAAIVIAAAA GDPDTVNRAV ADVARTYRGQ NIGSKRWRGS QPTLWRAFAP GGERRAALDE
FRNPTTTKRF APFVPLLASK LGNNTGVPLG MNLTSPGLRD VVLLDVLNAP ARENPANLVI
CGSPGRGKSH ATKNLSRSWL KLGAGLHLFD PTDAREHETA LADFDDKVVI DVSRMNFSLD
GLRVFPYKEA AERTIDHLLP QLGLSPLSRG AQRLWGLLAP ESREANGIGS TAQLIRYLRD
MPTARRTDAD EDLLIGLEGL AAQRLLRPLF DESLPVPDIA TTQCVIWNFA GLKLPTVTEE
YQAHLHQQTT PGQRAAQALY GLGAEVAQSI FFGRPDQPDM LVVEECAAWT NSPGGQKCAN
TIIRQGRKAW TGFCGISQQP IKDFAVLEDE FIDQRLCLGF KRSDIAKATL QWCDRDLDRH
PELLANYVNN TSPVQLVDHG DDAIDDRYGK VIPGREGEAW FLDEFGGFGK VALFAAPTAA
LAARFDTNPH RARQRSQATQ RS