Gene Mkms_1466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1466 
Symbol 
ID4614117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1580119 
End bp1581339 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID639791142 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_937468 
Protein GI119867516 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.333999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAG CCGCCTGTCC CCCGCCGTAC CGAGTCACGC AGTGGCTGCG GACGGCCGCG 
ATCGCGCTGG CCGTCGTGTT CGTGGCGGCC GCCTGTGCGC CCTCCCCGTC TGCTGCCCCT
CAAGCTGACG TCATCACTGA GAAGGGGACT CCGTTCGCCG ACCTGCTCGT GCCGAAACTG
CAGGCGTCGG TGACCGATGG GGCGATCGGC GTCTCCGTGG ACGCCCCGGT CACGGTCTCT
GCCGGTGACG GCGTTCTTGG CCAGATATCG CTGATTAACG AAGCAGGAGA GCTGGTCGAC
GGGCAGCTCA GCCCAGACGG TGTTTCTTGG GCCTCGGCCG AGCCGCTGGG TTACAACAAG
CAGTACACGT TGCGGGCGGA AGCGCTCGGA CTTGGTGGCG CCACCAGCAC GACAGCGACC
TTCGAGACGC ATTCCCCCGA CAATCTGACG ATGCCCTACG TCCTGCCCAA TGACGGCGAA
ACCGTAGGTA TCGGACAACC CATCGCTATT CGATTCGACG AGAACATCAC CGATCGGCTG
GCCGCACAGC GCGCCATTAC GGTGACCACC ACTCCTGCGG TGGAGGGCGC GTTCTACTGG
CTCAGCAATC GTGAAGTGCG TTGGCGCCCA GCCGAGTACT GGAAGCCCGG CACGACTGTC
GAGGTCGACG TCAACGCCTA CGGTGTCGAC TTCGGTGACG GGCTTTTCGG CCAGGACGAC
GTCACCACGC GGTTCACTGT CGGCGACCAG ATCATCGCGA CCGCTGATGA CGCCACCAAG
ACCATGACCG TGCGACGCAA CGGTGAAGTC GTCAAGACCA TGCCCATCTC CATGGGAAAG
GCCAAGACGC CCACCGACAA CGGCGTGTAC ATCATCGGCG ACCGCTACTC GTTCCTGGTG
ATGGATTCCT CGACCTACGG CGTCCCGGTC AATTCACCGG ACGGTTATCG AACCGAAGTC
GAATGGGCAA CCCAGATGTC CTACAGCGGC ATCTACGTCC ACTCCGCACC CTGGTCGGTG
GGCAGCCAAG GCGTCGCTAA TGTCAGCCAC GGGTGCCTCA ACGTCAATCC CGCCAACGCC
CGCTGGTTTT ACGACAACAC CCGACGCGGC GACATCGTGG AAGTCATCAA CACCACCGGG
CCTACCCTGT CGGGAACCGA CGGCCTCGGC GACTGGAACA TCCCCTGGCA ACAGTGGAAA
GCCGGCAACG CCAACCTGTA A
 
Protein sequence
MPKAACPPPY RVTQWLRTAA IALAVVFVAA ACAPSPSAAP QADVITEKGT PFADLLVPKL 
QASVTDGAIG VSVDAPVTVS AGDGVLGQIS LINEAGELVD GQLSPDGVSW ASAEPLGYNK
QYTLRAEALG LGGATSTTAT FETHSPDNLT MPYVLPNDGE TVGIGQPIAI RFDENITDRL
AAQRAITVTT TPAVEGAFYW LSNREVRWRP AEYWKPGTTV EVDVNAYGVD FGDGLFGQDD
VTTRFTVGDQ IIATADDATK TMTVRRNGEV VKTMPISMGK AKTPTDNGVY IIGDRYSFLV
MDSSTYGVPV NSPDGYRTEV EWATQMSYSG IYVHSAPWSV GSQGVANVSH GCLNVNPANA
RWFYDNTRRG DIVEVINTTG PTLSGTDGLG DWNIPWQQWK AGNANL