Gene Mkms_5216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5216 
Symbol 
ID4612899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5459039 
End bp5460166 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID639794913 
Productextracellular solute-binding protein 
Protein accessionYP_941195 
Protein GI119871243 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.943706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTT CACGGCTGCT GGCAGCCTGC ACATCCGCGC TTCTGCTCTC CGGTCTGGTG 
GCCTGCGCCC CGCCGGAGAA GGAGTCGGGC GGCGGCGAGA CCGATTCCGG CGTCCAGGTC
GGCGAGGCCA CCTCGGCGGC CGACTTCGGT GGCATGGACA AGCTGGTCGA GGCGGCCAAG
GCCGAAGGCG AACTCAACGT CATCGCGCTG CCCCCGGACT GGGCCAACTA CGGCACGATC
ATCGATACGT TCTCGAAGAA GTACGGCATC AAGGTGAACT CCGCGCAGCC GGACGCCGCC
AGCCAGGACG AGATCAACGC CGCCAACCAG CAGCGCGGTA AGTCCACTGC GCCCGACGTG
TTCGACCTCG GCCAGTCCGT GGCGCTGGCC AACACCGGCA TGTTCGCGCC GTACAAGGTC
GAGACGTTCG ACGACATTCC GGCCGAGTTC AAGGATCCCG ACGGCACCTG GGTCAACGAC
TACGGCGGCT ACATGTCCAT CGGCTACGAC TCCGCCAAGG TGCCGCCGAT CGCCAACGTC
GACGATCTGC TCAAGCCCGA GTACCGCGGC AAGGTCGCGC TCAACGGCGA CCCGACGCAG
GCCGGCGCCG CCTTCTCCGG CGTGATGATG GTGGCGCTGA GCCAGGGCGG TTCCGCCGAC
GACATCGCCC CGGGCGTCGA GTTCTTCCGC AAGCTCAAGG ACGCCGGCAA CTTCCTGCCC
GTCGACCCGA CCCCGGCCAC CATCGAATCC GGTCAGACCC CGGTCGTCAT CGACTGGGAC
TACCTCAACG CCGCGGAGAC CAAGAAGCTG CCGAGCTGGA AGGTGGTCGT GCCGCCCGGG
CAGGCCGTCG CCGGCTACTA CTACCAGGCC ATCAACAAGG ACGCCCCGCA TCCGGCCGCC
GCGCGCCTGT GGCAGGAGTT CCTCTACAGC GACGAGGGGC AGAACCTCTA CCTCGGCGGC
GGGGCACGCC CCGTACGGGC CGAACCGATG CAGCAGGCCG GCACGATCGA CAAGGCGGCG
TGGGATGCCC TGCCGCCGGT GAGCGGTGAG CCCGTCATCG TCTCGGTGGA GCAGAACAAG
AAGGCCACCG ATTACCTGGC CGGCAACTGG GCCAGCGCGA TCGGCTGA
 
Protein sequence
MTTSRLLAAC TSALLLSGLV ACAPPEKESG GGETDSGVQV GEATSAADFG GMDKLVEAAK 
AEGELNVIAL PPDWANYGTI IDTFSKKYGI KVNSAQPDAA SQDEINAANQ QRGKSTAPDV
FDLGQSVALA NTGMFAPYKV ETFDDIPAEF KDPDGTWVND YGGYMSIGYD SAKVPPIANV
DDLLKPEYRG KVALNGDPTQ AGAAFSGVMM VALSQGGSAD DIAPGVEFFR KLKDAGNFLP
VDPTPATIES GQTPVVIDWD YLNAAETKKL PSWKVVVPPG QAVAGYYYQA INKDAPHPAA
ARLWQEFLYS DEGQNLYLGG GARPVRAEPM QQAGTIDKAA WDALPPVSGE PVIVSVEQNK
KATDYLAGNW ASAIG