Gene Mkms_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2789 
Symbol 
ID4615710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2908105 
End bp2909205 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID639792454 
Producthypothetical protein 
Protein accessionYP_938773 
Protein GI119868821 
COG category[R] General function prediction only 
COG ID[COG1537] Predicted RNA-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTCG AACGCTTCCG AACGCTGGCG GAGTCGAAGG GGCCGTACGC ATCGGTGTAC 
TTCGACGATT CGCACAACAC CGAGGATGCC GCTGCTCAGC GGGAACTCAG GTGGCGGGCG
GTCCGAGACG ACCTCGAAGA GCAGGCTGCG CCGGCCGAGT TGATCGAGGC GTTGCAGCCC
GCCGTGCTCG ACGCGCCGCC CGCCGTCGGC CGCAGCGGTC GAGGCCTGAT CGTCGGGGCC
GACGGTGTGC TGCTCAACGA ACACCTGATC CGGCCGATCG AGATTCCCGT TGTCCGCGTG
TCGTCGCTGC CGTACCTCGT ACCGGTGGTG GAGCACGGTG ATCAGCACTC GACCTATGTG
ATCGTCGCCG TCGACCACGC CGGCGCCGAC ATCGCTCTGC ATCGCGACCG ATCGGTCGTC
TCGGACACCG TCGACGGTGG CGGCTACCCC GTCCACAAGG CCGACAGCGC CGAGACGCCC
GGTTACGGTG ACCCGCAGCG GCGCAGCACC GAGGCAGGGC GCAAGAATCT GCGGGCCGTC
GCCGAGCGGT TGGCGACGCT GGTCGACGAG GCCTCCCCCG AGGTGGTATT CGTTGTCGGC
GAAGTGCAGT CACGTTCGGA TCTGGCGCCC ATGCTGGACG AGCGAGTGGC CGATCGGGTG
GTCGAGCTCG ACGTGGGCGC CCGGCACAGC GGTTTCGCAG ACGCCGACCT CCGCCACGCC
GTCGACCAGG AGTTCCTCCG GCGTCGACTC GCGACGATCG AGCATGCGGC AGAACAGTTC
TCGCAGGCGA TCGGTCAGGG TTCGGGGCTG GCCACCCAGG GTCTGCACGG CGTCTGCGCC
GCACTGCGGG CCGGCGCCGT GGAAACCCTC ATCATCGGAG ACATCGGCGA TGCGACCGTC
GTCGCCGGCG ACGACCTGCT GACACTCGCG CCGAACGAGA AGCTGCTGTC GGAGCTGGGC
ACCGCGCCGG CCCAGACGCT GCGGGCCGAC GAGGCGCTTC CCCTTGCCGC CGTACGCACC
GGTGCGGCGC TGGTCCGCAC CGACGAGCGA ATCGACCCGG ACGACGGAAT CGCCGCGGTG
CTGCGTTACG CCTTGGCCTG A
 
Protein sequence
MQFERFRTLA ESKGPYASVY FDDSHNTEDA AAQRELRWRA VRDDLEEQAA PAELIEALQP 
AVLDAPPAVG RSGRGLIVGA DGVLLNEHLI RPIEIPVVRV SSLPYLVPVV EHGDQHSTYV
IVAVDHAGAD IALHRDRSVV SDTVDGGGYP VHKADSAETP GYGDPQRRST EAGRKNLRAV
AERLATLVDE ASPEVVFVVG EVQSRSDLAP MLDERVADRV VELDVGARHS GFADADLRHA
VDQEFLRRRL ATIEHAAEQF SQAIGQGSGL ATQGLHGVCA ALRAGAVETL IIGDIGDATV
VAGDDLLTLA PNEKLLSELG TAPAQTLRAD EALPLAAVRT GAALVRTDER IDPDDGIAAV
LRYALA