Gene Mkms_5750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5750 
Symbol 
ID4610259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp259923 
End bp260948 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content65% 
IMG OID639789406 
Productextracellular solute-binding protein 
Protein accessionYP_935741 
Protein GI119855136 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.177697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.666773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCCC GTCGGAGTCG TTGGAGTCGA ATCACCGCAC TGGGGGCCGC CGTCGCGCTG 
GCAGTCGGTC TCACCGCCTG CTCGTCGTCG GAGGAAGAGG ACGGCCTGCT GATCTACAAC
GCGCAGCACG AGTCGCTGAC CAAGGAGTGG ATCGACGCCT TCACCAAGGA AACCGGCATC
AAGGTCACCT ACCGCCAAGG CGGCGACACC GAACTGGGCA ACCAGCTGAT CGCCGAAGGC
GACTCATCGC CCGCCGACGT GTTCCTCACC GAGAACTCCC CCGCCATGGC CGCCGTCGAG
AAGGACGGCC TATTCACCGA CGTCGACCAG GCCACCATCT CCCAGGTGCC ACCGCAATTC
CGCCCCACCA CCAGCAAGTG GACCGGCGTC GCCGCCCGCA CCACCGTGTT CGCCTACGAC
AAGACCAAGC TCACCGAGGC ACAGCTGCCC CGGTCGATCA TGGATCTGGA GAAGCCCGAG
TGGAAGGGCC GCTGGGGCGC CCCGCCGGTC AAGCCCGACT TCCAGGCCAT CGTCGCCGCA
ATGCTCGAAC TCACCGGTGA GCAGGCCACC AGCGCGTGGC TGTCCGCCAT GAAGGCGAAC
GCCGAGATCT ACTCCGACAA CATCGCCACC TTGCGCGCGG TCAACGACGG CCAGGTCGAG
GGCGGGATCA TCTACCACTA CTACTGGTTC CGCGATCAGT CGCAGACCAA GGAGATCTCG
GGCAACACCG CACTGCACTA CTTCCGCAAC CAAGACCCCG GCGCCTTCGT CTCGATCTCC
GGCGGCGGCA TCCTGAACTC CAGCAAGAAG AAGGAAGACG CCCAGAAGTT CCTCACCTAC
GTCACCAGCA AAGCTGGTCA GGAAGTGCTC GAGAACGGAA CCTCGTTCGA ATACCCCGTC
GCCAGCGGCG TGCCTGCCAA CCCCGCGCTG GTGCCGCTGG CCGGCCTGCA AGCACCCGCC
GTCAACCCGT CGAACCTCAA CGCGCAGAAG GTCACCGACC TGATGACGAA GGCGGGCCTG
CTCTAG
 
Protein sequence
MSARRSRWSR ITALGAAVAL AVGLTACSSS EEEDGLLIYN AQHESLTKEW IDAFTKETGI 
KVTYRQGGDT ELGNQLIAEG DSSPADVFLT ENSPAMAAVE KDGLFTDVDQ ATISQVPPQF
RPTTSKWTGV AARTTVFAYD KTKLTEAQLP RSIMDLEKPE WKGRWGAPPV KPDFQAIVAA
MLELTGEQAT SAWLSAMKAN AEIYSDNIAT LRAVNDGQVE GGIIYHYYWF RDQSQTKEIS
GNTALHYFRN QDPGAFVSIS GGGILNSSKK KEDAQKFLTY VTSKAGQEVL ENGTSFEYPV
ASGVPANPAL VPLAGLQAPA VNPSNLNAQK VTDLMTKAGL L