Gene Mkms_5454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5454 
Symbol 
ID4613138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5687402 
End bp5688961 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content68% 
IMG OID639795148 
Producthypothetical protein 
Protein accessionYP_941429 
Protein GI119871477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.157039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0069948 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTATGTTC GAAGTATGTC AGGGCACGAC CTCCAGGCCG CGGTGACCGC GCTGCGTGCG 
GCCTTCGATG AGGTGGCCTC CTGTGATGTC GCCTTGTTGG ACCGCGCCGA ACTCGTTGCG
GCGCTGGATG AACTCGAGGC CCTGGGGTGC CGGCTGCCCA CGATGAGCCA CCGCTTGCTG
GCCCGTCTGC GGTCCGAGGC GACACCGCAG CAGATGGGTG CCAAGTCGTG GAGAGAGGTG
CTGACGGTCC GCTGGCGGAT CTCGGGCAGT GAGGCCAACC GACGACTCAC CGAGGCCGGT
CTGCTGGCGC CGCGCCAGGC ATTGACCGGT CCTTCGCTGC CGCCGGTGTT ACCCGCAACG
GCTGTGGCTC AAGCGCACGG GTTGATCAAC AGCGAACACG TCGAGGTCAT CCGCAGAGCG
GTCGACAAGT TGCCGGGGTT CGTCGACACC GTCACGCGGG AGCAGTTCGA GGTCACCCTG
GTCCGCACCG CGGTCGGTGT CGGCCCCAAG GAACTCAAGG ACACAGCCGA CCTCACGTTG
TTCCTGCTCG ATCAGGACGG TCCCGAGCCC GATGACACCG AACGGGCCCG CAAGCGTGGC
GTGTCGCGAT CGAAACAACG CCCCGACGGG ATGGTCGACC TGTCCGGGCA CCTGACGCCG
GAAGCGTGGG CGGTGTGGGA GGCGATCTTC GCGAAGTACG CGGCGCCGGG CATGTGCAAT
CCCGACGATC CCGAACCCTG CACGTCGGGG ACCCCGTCGC AGGAGCAGAT CGACAACGAC
CACCGCACCC TGGCCCAGCG CCAACACGAC GCGATGGTCG CGATCGGGCG CATCGCCTTG
ATGAGCGGCG AACTCGGCCA ACTGAATGGA CTGCCGGTCT CGATCATCAT CCGCACCACG
CTGGAGGATC TGGAGTCGCG GGCCGGGGTC GGCACCACCG GTGGCGGCAC CGTCGTGCCG
ATCGCCGATG TGATCCGGAT GGCCGGCCAC GCCAACCACT ACCTGGCGGT GTTCGACGGA
GCTACCGGAT CAGCGCTTGA TCTGTTCCGC GCCAAGCGGA CTGCCTCGGC TGCGCAACGC
ATCATGCTGA TCGCGCGCGA TGGCGGATGC ACCAAACCGT GTTGCACTGT CGGCGCCTAC
GGCTGCCAGG TGCATCATGT GGATGCCGAC TGGTCAGACG GCGGCAACAC CAACGTCGAC
GAACTCGGGC TCGCGTGCGG GGCGGACAAC CGCAGCGTCG ACAAAGACGG CGGCTGGTCC
ACCCGCATGA ACGATCAGTG CGAAGTCGAA TGGATCCCGC CGCCACGGCT GGACACCGGC
CAGGCCCGGC TCAACCACTA CCACCGGCCC GAACGCCTTC TCCGGCCACC CGACGACCCG
AGCGTTCCCG GCGATCCCGT TGTATGGGCG GAGCCGGCTG ACGCCAACGG CATCAGCGAC
GCCGAGCCGG CTGACGAGTG CGACGAGACT GTTCCCGCCG AGCCGGACTC GCCGACGCAG
TCCGCCGACA GCGCCGGTGA ACCCGGCGGC CCCGCACCTC CGGAAGGTCG GGCGGCATGA
 
Protein sequence
MYVRSMSGHD LQAAVTALRA AFDEVASCDV ALLDRAELVA ALDELEALGC RLPTMSHRLL 
ARLRSEATPQ QMGAKSWREV LTVRWRISGS EANRRLTEAG LLAPRQALTG PSLPPVLPAT
AVAQAHGLIN SEHVEVIRRA VDKLPGFVDT VTREQFEVTL VRTAVGVGPK ELKDTADLTL
FLLDQDGPEP DDTERARKRG VSRSKQRPDG MVDLSGHLTP EAWAVWEAIF AKYAAPGMCN
PDDPEPCTSG TPSQEQIDND HRTLAQRQHD AMVAIGRIAL MSGELGQLNG LPVSIIIRTT
LEDLESRAGV GTTGGGTVVP IADVIRMAGH ANHYLAVFDG ATGSALDLFR AKRTASAAQR
IMLIARDGGC TKPCCTVGAY GCQVHHVDAD WSDGGNTNVD ELGLACGADN RSVDKDGGWS
TRMNDQCEVE WIPPPRLDTG QARLNHYHRP ERLLRPPDDP SVPGDPVVWA EPADANGISD
AEPADECDET VPAEPDSPTQ SADSAGEPGG PAPPEGRAA