Gene Mkms_5593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5593 
Symbol 
ID4610243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp103736 
End bp106285 
Gene Length2550 bp 
Protein Length849 aa 
Translation table11 
GC content69% 
IMG OID639789256 
Producthypothetical protein 
Protein accessionYP_935591 
Protein GI119854986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG TCCGCGCGCT ACCCCTGACC ACAGCACCGG CGGCGGGGGA GGCGCTGGAT 
TCCTGGTTAC ACGCCATCGC GGTGCGCCAC AACACGTCAC TCAACGCGCT GTACCACCAC
ATGGGCCTAG ACACCGTTGC CGATCTGCGC ACCCGCGTCA CCGCTGCCAC GGTAAGCGAG
GACGACGCGC AACGGATCGC CGCCGCAACA GGACTCACAC CGACACAGGT GCACGCGATG
ACCTTGGCGC ACTACCTAGC GAACGTGTCC TGGGCACAGG AAGCGACAGC GGATCTGACC
GCAACCACGC CGCGCCACCA CGGCGGGTGG CGGTTCTGTC CGCACTGCCT CGGCGCGAGC
GGGGGAACGT GGCTGCTGCA GTGGCGATTG ATGTGGAGCT TCGCCTGCCT GCAGCATCGA
TGCCTGCTGG CCGAACACTG CCCGCGCTGC GGCCGGCGCC AACGCGCGCC CCAACCCCTC
AACGCCGCCC CACCGCGTCC AGTGCACTGT GCTCACCCCA GCCCCACGGC TGCCGGCAGA
AACCGGCTCC GCTGCGACGC CGACCTGGCC GACACCCCAG TCACCATGCT GGAGGCTGAT
CATCCGACAC TGCTCGCCCA ACAGGTGCTC CTCGACCTGG TCGCCGCCGA GAATGGCCAG
TTCGGGCTCT ACGCGCAGCA TCCGATGCCA GTTCGGGACG TGCTGGCCGA TATCCGCATC
CTTGGCCGCG GCATCCTGTC GGCCACCGCC GGTCGCTACC TCGAGGACCT GCTGCCCGCC
GAGTTGGCCG CACAATATCG ACACCAGCGG CAGGCGGCCA ACGCGCAGAT GTCATCGTCG
GTCCTGACGT GCGCCGCGGC GATCACCGCG GCCGTCACCG TGCTCAGCAG GCCCGACTTC
GGGTCGGCCG CGGCGCCGCT GGCAGCGCTG CCTGAAGAGG TCAGCCGCTA CGTGCTGCAT
CAATGGACCT ATCTCGACCT GCCGGCCGGG CCGTCGGCGA GCCCGGTGCT TCAGGCCCTG
CATTTCACCG CACTCGGTGA CCGCCTGAGC CCCGCTGCGC AGTTGCAGTG CCGCCTCGGC
AGTGCCTTCC CAAGGCTGCA CGCCACCGAC ACGCAGCGGC AGCAGCAGCT GGTTCGGGCG
CTGCCCACAG CATTGTGGAC GGGCTGGGCG TTGCGGCTGA GCCCGCCGGC GTTGGCCCAT
TCCAGCTCGC GGGTGGCGCT GGCTGCCGCA GTGCTGCTGG TCGGCAGCGA CCTGGACATC
GCCGAGGCCA CCGCGCATCT CGGCGGCGAG CTGGCCCGCC TCAACGCCAT CTACCTGCTG
TGGCGGCTCA AAGAATCGCC GCACTGGCCC GCGATCCGCG ACGCCCTCGC TGCCCTATCG
GACTACCTCA GCGAGGAGAG TGCACCCATC GACTACGCGC GGCGACGGCG CCTGGACTAC
CGCGGCCTGC TCGCCCAAGA CGAGTGGGAC CGCATCTGCG GAAGCCTCAG CCACCAAAGA
TGCTCGGCAG CCCACCCACG CAGCTACCTG CAGCACCAGC TCCGCGGCAC CACCAAACCC
CGCACCGGCC TGGACGGCCC CGACACCGAG CTCAGGGAAT TCCCGGGCCG CCTCACCCCG
CAGCTCAGCC GCGCCCTACA CCGTCACGCA ACGAGGTTCC TTGCAACCCA AGGCATTCAG
GACGAGCCGG TGGTGTGGGA ACCCCCGACG AATATCGTCG CCGGCCTATC GTTACCCGGT
GTCGAGCTCG GCGATATCGA GATCGCGGAG CTACACCGAC TAATCCGGGT TGACCGCCGC
ACCATCGCCG CCACCAGCCA GCAGTTAGGA GTCAGCGGCG ACCTCATTCG CTATGCACTC
GAACAGCATC CCGCCCCGGC GCTGCCCCCG CGCCCGCGTG CCCCGCGCAT CGGCGCCCGG
CGGCCCGGCA GGGTGTATCA GCAAGCCGCC GAAGCGCTGC CGCGGGAACG CCTCGTCGCC
CTGTACAGCC AGGAACAACG CAGCCTCGCC GACATCGCCG CCCTGACCGG GTTCAGCAAA
CGCACCCTCT CTCGATTGAT GCACGACTAC ACGATTCCCC TGCGGCCGTC CGGGCCACGG
CCCACTGAGC CGGTCGACCC CGACTGGCTC TACACCGAAT ACATCGTCCG CCAACGCTCG
TGTGCAGACC TCGCGCGAGA ACTCCAAATC CGCTGCGGAG CGGTAGCCGC CCAAGCCGCC
ACGCTCGGAA TGCCGGTACG CACCGTGGCC CGACACACCG ACGCCGAACT CCGCGACAAC
CCAAAGATCC CGGCCATCCT CATCCCGGCC CTCGTCGGCC ACGGCGGCTG GGAACGGCTA
CAACGCTTCG CCGTGATCAC CCAGTTCACC ACGCTCACCG ACGCCGGGAA ACACCTCGGC
AGGGGAGTAG CCGTCACCGG CCACCACATC GCCCGCCTGG AAAAAGACTT CCGCGCACGC
CTGCTCACCC AGAAACCGCT GCGCTGCACC GACTTCGGCA AAGACGTCCT CGCGGCGGTG
CACCGACTCG CCGAACTCGG CGGACCCTGA
 
Protein sequence
MTTVRALPLT TAPAAGEALD SWLHAIAVRH NTSLNALYHH MGLDTVADLR TRVTAATVSE 
DDAQRIAAAT GLTPTQVHAM TLAHYLANVS WAQEATADLT ATTPRHHGGW RFCPHCLGAS
GGTWLLQWRL MWSFACLQHR CLLAEHCPRC GRRQRAPQPL NAAPPRPVHC AHPSPTAAGR
NRLRCDADLA DTPVTMLEAD HPTLLAQQVL LDLVAAENGQ FGLYAQHPMP VRDVLADIRI
LGRGILSATA GRYLEDLLPA ELAAQYRHQR QAANAQMSSS VLTCAAAITA AVTVLSRPDF
GSAAAPLAAL PEEVSRYVLH QWTYLDLPAG PSASPVLQAL HFTALGDRLS PAAQLQCRLG
SAFPRLHATD TQRQQQLVRA LPTALWTGWA LRLSPPALAH SSSRVALAAA VLLVGSDLDI
AEATAHLGGE LARLNAIYLL WRLKESPHWP AIRDALAALS DYLSEESAPI DYARRRRLDY
RGLLAQDEWD RICGSLSHQR CSAAHPRSYL QHQLRGTTKP RTGLDGPDTE LREFPGRLTP
QLSRALHRHA TRFLATQGIQ DEPVVWEPPT NIVAGLSLPG VELGDIEIAE LHRLIRVDRR
TIAATSQQLG VSGDLIRYAL EQHPAPALPP RPRAPRIGAR RPGRVYQQAA EALPRERLVA
LYSQEQRSLA DIAALTGFSK RTLSRLMHDY TIPLRPSGPR PTEPVDPDWL YTEYIVRQRS
CADLARELQI RCGAVAAQAA TLGMPVRTVA RHTDAELRDN PKIPAILIPA LVGHGGWERL
QRFAVITQFT TLTDAGKHLG RGVAVTGHHI ARLEKDFRAR LLTQKPLRCT DFGKDVLAAV
HRLAELGGP