Gene Mkms_4850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4850 
Symbol 
ID4616265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5078824 
End bp5080164 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content73% 
IMG OID639794541 
Producthypothetical protein 
Protein accessionYP_940830 
Protein GI119870878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGTG GCGGGTACAG TCGGCCCATG ACTGTCCTGC CCCGCGGTCC GCGCGCTCGG 
CGCGGCAATC GCAGGCCGGG CTGGATGCTG CTGACAGTGT TGCTGGTCCT GGCGATCGTG
GCAAGTTCTG CGCTGGTGTT CACCAACCGG GTGGAATTGC TCAAACTCGC TGTCATCCTG
GCGTTGTGGG CGGCGGTGGC CGCGGCGTTC GTCTCGGTCA TCTATCGACG GCAAAGCGAT
CACGACCAGG CCAAGGCGCG CGATCTCAAA CTCGTCTACG ACCTTCAACT CGACCGCGAG
ATCTCGGCGC GCCGCGAATA CGAGTTGACG GTGGAGACGC AGCTGCGCCG CGAGCTGGCG
AACGAACTGC AGGCCCAGGC CGCTGACGAG GTGGCCGCGC TGCGCGCAGA GCTCGCAGCG
CTGCGGACCA ACCTGGAGAT CCTCTTCGAC GCCGACCTGG CCCACCGGCC GGCGCTCGAA
CACGACCGCA CCACCGTGCG CGCCTACAGC GACTGGGCAC CGGACAACGA GACCACCGGC
CGGGTGAGCA GCAGCAGGCT CGACGAGATG CTGCGCGACG CCGACATCGA CGAGAGCGAG
TCGCGCACCG AGGAAAGCCC GATCATCGAC GTGCCCGCCG AACCGCAGCC GCCCGAACCG
GATCTCGTGC CGCCGCCGGG ATTCGGTGGC GCGCACCGCA GGCCCTCGGA AGCCGAACCG
CCCACCGAGG AGCCGCCACG TCGTCGGCGG CGCCGCGCCG AGGAGGCGCC CCCGGTGGCC
GAGGCACCGG CCTGGGCGGC CCCGACTCCC GAGCCGCCCC CAGAGCCAAC CCCAGAGCCC
GAGCCAAAGC CCGAGCCAGA GCCGGAGTCC GACGCCGACC TGGACACCGA TCCGGGCATC
TCCCGGCTGT CGGAGGAGCC GACCTCATGG GATCGGCCCG CCCCGACACC CGAACCCGAG
CCCGAACCCG AACCGGTCGG CGGTTGGCAG CCGGTGCCCG CCGAAGGGCA GTGGATCCCC
GCGGGCGCCC CGGGCAGTCA CTGGACGGCC CCGGTGGCCG AGGAGCCGCC GTCGGAGTAC
GTCGGACGAC GACGCGCACA GGAACCCCAA CCTGCGCCCG AGTCGCGCCA CGGCAAGCAC
TCCGCGCCGG GTGACGGCGA CGAGGACGGG CCGCTCGCGC CGCCGCCCGC ACCCATCCAG
GGGCCGCAGG AGACCGCCGA GGAGGCCCAC GCCCGTCGGC GCCGCAGCCT CGAGGACACC
GGCGGGCAGT CCGTCGCCGA ACTGCTCGCG CGGCTGCAGG CCAACCCCAC CGGCGGCGGT
CGTCGGCGCC GCGAGCCGTG A
 
Protein sequence
MTCGGYSRPM TVLPRGPRAR RGNRRPGWML LTVLLVLAIV ASSALVFTNR VELLKLAVIL 
ALWAAVAAAF VSVIYRRQSD HDQAKARDLK LVYDLQLDRE ISARREYELT VETQLRRELA
NELQAQAADE VAALRAELAA LRTNLEILFD ADLAHRPALE HDRTTVRAYS DWAPDNETTG
RVSSSRLDEM LRDADIDESE SRTEESPIID VPAEPQPPEP DLVPPPGFGG AHRRPSEAEP
PTEEPPRRRR RRAEEAPPVA EAPAWAAPTP EPPPEPTPEP EPKPEPEPES DADLDTDPGI
SRLSEEPTSW DRPAPTPEPE PEPEPVGGWQ PVPAEGQWIP AGAPGSHWTA PVAEEPPSEY
VGRRRAQEPQ PAPESRHGKH SAPGDGDEDG PLAPPPAPIQ GPQETAEEAH ARRRRSLEDT
GGQSVAELLA RLQANPTGGG RRRREP