Gene Mmcs_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2038 
Symbol 
ID4110871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2186735 
End bp2188072 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID638031159 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_639202 
Protein GI108799005 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACGC TGTTGTGGTT CCGCCGCGAT CTGCGCCTGC ACGATCTGCC CGCGCTGGTC 
GATGCGGCCC AGGGCGACGG TCAGGTGCTC GCCTGCTATG TGCTGGATCC GAGACTGCAC
AGGTCGGCGG GCCCGCGGCG GCTGCAGTAC CTGCACGACG CCCTGCGGGA TCTGCGCGAC
CAGCTCGACG GCCGGCTGCT GGTGACCCGC GGCCGCCCGG AGCAGCGGAT CCCGGCGCTG
GCGAAGAGCA TCGACGCGTC GGCGGTGTAC GTCTCCGGCG ACTTCACCCC GTTCGGACGG
CGACGCGATG ACGCCGTCCG GAAAGCTCTG GGCGAGGTTC CCCTCCAACC GTCCGGTTCG
CCCTATCTGG TGTCGCCGGG CCGGGTCACC AAGGGCGACG GGACGCCGTA CAAGGTGTTC
AGCCCGTTCT TCGACGCCTG GCGCAGACAC GGTTGGCGCG CGCCGGCACA GAGCGGGCCG
GATTCGGCGA CGTGGATCGA CCCGGCGGAT CTGACGGGCC GCGATCTGCG GACCGAGATC
CCCGACGACG GCGCCACCCT GGACATCCCC GCGGGTGAGC GCGCCGCGGC GCAGCACTGG
CGGGCGTTCG TCGCCGACGA ACTCGACGGT TACGCCGACA ACCGCAACCG CCCGGACCTC
GACGTCACCA GCCGGATGTC GGCGCACCTG AAGTTCGGGA CCATCCATCC CCGCACGATG
GTCGACGACC TCGGGCGGGG CAAAGGCGCC CAGGCGTATC TGCGGGAACT GGCGTTCCGC
GACTTCTACG CGGCGGTGCT CCACGAGTGG CCCCGCAGCG TGTGGTGGAA CTGGAACACC
GGGTTCGACG GCATCCGCGT CGACGAGGGT GCGGTGGCCG AGCAGCGCTT CGACGCGTGG
AAGCGCGGGC GCACCGGGTT CCCGATCGTC GACGCCGGGA TGCGTCAACT CGCCGGGATC
GGCTGGATGC ACAACCGGGT CCGGATGATC GTGGCCTCGT TCCTGGTCAA GGACCTGCAC
CTGCCGTGGC AGTGGGGGGC GCGCTGGTTC CTCGAGCAGC TGGTCGACGG CGATATGGCC
AACAACCAGC ACGGGTGGCA GTGGACCGCG GGGTGCGGCA CCGACGCCGC ACCGTTCTTC
CGGGTGTTCA ACCCCTCGAC GCAGGGCGCG AAGTTCGATC CGGACGGCAC GTACGTGCGG
CGGTGGGTGC CCGAACTGAA GGGGGTGGCC GACGTGCACA AGATGGGTGA CGATCGCCCG
GCGGACTACC CCGCACCCAT CGTCGACCAT GCGGCCGAAC GGGCCGAGGC GCTGCGCCGC
TACGCCGAGA TCTCCTAG
 
Protein sequence
MPTLLWFRRD LRLHDLPALV DAAQGDGQVL ACYVLDPRLH RSAGPRRLQY LHDALRDLRD 
QLDGRLLVTR GRPEQRIPAL AKSIDASAVY VSGDFTPFGR RRDDAVRKAL GEVPLQPSGS
PYLVSPGRVT KGDGTPYKVF SPFFDAWRRH GWRAPAQSGP DSATWIDPAD LTGRDLRTEI
PDDGATLDIP AGERAAAQHW RAFVADELDG YADNRNRPDL DVTSRMSAHL KFGTIHPRTM
VDDLGRGKGA QAYLRELAFR DFYAAVLHEW PRSVWWNWNT GFDGIRVDEG AVAEQRFDAW
KRGRTGFPIV DAGMRQLAGI GWMHNRVRMI VASFLVKDLH LPWQWGARWF LEQLVDGDMA
NNQHGWQWTA GCGTDAAPFF RVFNPSTQGA KFDPDGTYVR RWVPELKGVA DVHKMGDDRP
ADYPAPIVDH AAERAEALRR YAEIS