Gene Mkms_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2968 
Symbol 
ID4610798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3098269 
End bp3099846 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content69% 
IMG OID639792634 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_938952 
Protein GI119869000 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.278599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGATTC GTACGCGACA CGCCGAGGAC AGCGTATGGC GTCGCACACT CGAGCCCACC 
CAGCGCCCGC CTCGTAGCCT CGAAGAGGTG ACCGGTACGC GTGACGACAC CCCACTGTGG
CTCTTCGCCG ACCAACTCGG CCCGGCCGTC CACGGCGGCG AGCACGCCCA CCGCGACGTG
CTGCTCATCG AGGCCGACCA CGCCCTGCGC AAGCGCCGCT ACCACCGCCA GAAACTGCAC
ATCGTGCTGT CCGCGCTGCG CCACGCCGAC CGCGACCTCG GCGACCGCGC CACCCTCCTC
CGCTCCGAGA CCTACACCGA CGCGCTCGAA CGCTACGGCC GGCCCGTCCT CGTCCACGAG
CCGACGTCCT TCGCCGCCGA GAGGTTCGTC CACCGCCTCA AACAGCGTGG CCTCGTCGCC
GACATCCTGC CCACCCCGAC ATTCGCGTTG CCGCGCAAGG ACTTCGAACA GTGGGCCGGG
AACCGCACCC GGTTCCGCAT GGAGGACTTC TACCGCGACC AACGCCGCCG CTTCGACGTC
CTGATGAGCG GGGCCGATCC CGTCGGCAAC CGGTGGAACT ACGACGAGGA GAACCGCCAC
TCCCCACCGA AGAAGCGGCG CACCCTCGAC GTGCCCGCGC CGTACAAGCC CCGCGAGGAC
GACATCGACG AAGAGGTCCG CCGCGACCTC GACCGGATGG ACCTCGACAC CGTCGGCGCC
GACGGCCCCC GCCTGTTCGC CGTCACACCC GCCGAAGCCA AACGCGCCCT CACCCGCTTC
ATCGAGCACC GCCTGCCGAC CTTCGGCGAC TACGAGGACG CGATGATGGG CGAGGACTGG
GCGATGTCGC ACTCACTGTT GTCGGTGCCG CTCAACCTCG GCGTGCTCCA CCCCCTCGAC
GCCGTGTACG CCGCCGAACA GGCCTACCGC GACGGGACCG CGCCGCTGGC TGCCGTCGAG
GGGTTCATCC GCCAGATCCT CGGCTGGCGC GAGTACATGT GGCATCTCTA CTGGCATTTC
GGCGAGCGGT ACGTCGACAG CAACGAACTC GACGCCAGGA CACCGCTTCC GGACTGGTGG
GCCGACCTCG ACGCCGACGC CGTGACCGCC GAATGCCTGC GCCACGCGCT GATGGGGCTT
CGTGACCGGG GCTGGACGCA CCACATCCAG CGGCTGATGA TCCTCGGCAG CCACGCCCTG
CAGCGCGGAT ACCACCCTCG CGAACTCACC GAGTGGTACG CCACCGCCTA CGTCGACGGC
TTCCGCTGGG TCATGCCCAC CAACGTCGTC GGGATGAGCC AGCACGCCGA CGGTGGCATG
CTCGCCACCA AGCCGTACAC CTCCGGCGGC GCCTACATCA ACAAGATGAG CGACCACTGC
GGCGACTGCG CCTACGACCC GCGTAAACGC CTCGGCGAGG ACGCCTGCCC GTTCACGGCC
GGCTACTGGG CCTTCGTGCA CCGCCACCGC GACCGGCTCG AGCGCAACAT GCGCACCCGC
CGGGCGGTAC AGGGGTTGAA CCGGCTCGGC GACCTCGAGG ACGTCCTCGC CCAGGAGGAC
AAGCGCACAC GGTTCTAG
 
Protein sequence
MRIRTRHAED SVWRRTLEPT QRPPRSLEEV TGTRDDTPLW LFADQLGPAV HGGEHAHRDV 
LLIEADHALR KRRYHRQKLH IVLSALRHAD RDLGDRATLL RSETYTDALE RYGRPVLVHE
PTSFAAERFV HRLKQRGLVA DILPTPTFAL PRKDFEQWAG NRTRFRMEDF YRDQRRRFDV
LMSGADPVGN RWNYDEENRH SPPKKRRTLD VPAPYKPRED DIDEEVRRDL DRMDLDTVGA
DGPRLFAVTP AEAKRALTRF IEHRLPTFGD YEDAMMGEDW AMSHSLLSVP LNLGVLHPLD
AVYAAEQAYR DGTAPLAAVE GFIRQILGWR EYMWHLYWHF GERYVDSNEL DARTPLPDWW
ADLDADAVTA ECLRHALMGL RDRGWTHHIQ RLMILGSHAL QRGYHPRELT EWYATAYVDG
FRWVMPTNVV GMSQHADGGM LATKPYTSGG AYINKMSDHC GDCAYDPRKR LGEDACPFTA
GYWAFVHRHR DRLERNMRTR RAVQGLNRLG DLEDVLAQED KRTRF