Gene Mjls_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2021 
Symbol 
ID4877742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2128014 
End bp2129351 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID640139319 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_001070299 
Protein GI126434608 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.128536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACGC TGTTGTGGTT CCGCCGCGAT CTGCGCCTGC ACGATCTGCC CGCGCTGGTC 
GATGCGGCCC AGGGCGACGG TCAGGTGCTC GCCTGCTATG TGCTGGATCC GAGACTGCAC
AGGTCGGCGG GCCCGCGGCG GCTGCAGTAC CTGCACGACG CCCTGCGGGA TCTGCGCGAC
CAGCTCGACG GCCGGCTGCT GGTGACCCGC GGCCGCCCGG AGCAGCGGAT CCCGGCGCTG
GCGAAGAGCA TCGACGCGTC GGCGGTGTAC GTCTCCGGCG ACTTCACCCC GTTCGGACGG
CGACGCGATG ACGCCGTCCG GAAAGCTCTG GGCGAGGTTC CCCTCCAACC GTCCGGTTCG
CCCTATCTGG TGTCGCCGGG CCGGGTCACC AAGGGCGACG GGACGCCGTA CAAGGTGTTC
AGCCCGTTCT TCGACGCCTG GCGCAGACAC GGTTGGCGCG CGCCGGCACA GAGCGGGCCG
GATTCGGCGA CGTGGATCGA CCCGGCGGAT CTGACGGGCC GCGATCTGCG GACCGAGATC
CCCGACGACG GCGCCACCCT GGACATCCCC GCGGGTGAGC GCGCCGCGGC GCAGCACTGG
CGGGCGTTCG TCGCCGACGA ACTCGACGGT TACGCCGACA ACCGCAACCG CCCGGACCTC
GACGTCACCA GCCGGATGTC GGCGCACCTG AAGTTCGGGA CCATCCATCC CCGCACGATG
GTCGACGACC TCGGGCGGGG CAAAGGCGCC CAGGCGTATC TGCGGGAACT GGCGTTCCGC
GACTTCTACG CGGCGGTGCT CCACGAGTGG CCCCGCAGCG TGTGGTGGAA CTGGAACACC
GGGTTCGACG GCATCCGCGT CGACGAGGGT GCGGTGGCCG AGCAGCGCTT CGACGCGTGG
AAGCGCGGGC GCACCGGGTT CCCGATCGTC GACGCCGGGA TGCGTCAACT CGCCGGGATC
GGCTGGATGC ACAACCGGGT CCGGATGATC GTGGCCTCGT TCCTGGTCAA GGACCTGCAC
CTGCCGTGGC AGTGGGGGGC GCGCTGGTTC CTCGAGCAGC TGGTCGACGG CGATATGGCC
AACAACCAGC ACGGGTGGCA GTGGACCGCG GGGTGCGGCA CCGACGCCGC ACCGTTCTTC
CGGGTGTTCA ACCCCTCGAC GCAGGGCGCG AAGTTCGATC CGGACGGCAC GTACGTGCGG
CGGTGGGTGC CCGAACTGAA GGGGGTGGCC GACGTGCACA AGATGGGTGA CGATCGCCCG
GCGGACTACC CCGCACCCAT CGTCGACCAT GCGGCCGAAC GGGCCGAGGC GCTGCGCCGC
TACGCCGAGA TCTCCTAG
 
Protein sequence
MPTLLWFRRD LRLHDLPALV DAAQGDGQVL ACYVLDPRLH RSAGPRRLQY LHDALRDLRD 
QLDGRLLVTR GRPEQRIPAL AKSIDASAVY VSGDFTPFGR RRDDAVRKAL GEVPLQPSGS
PYLVSPGRVT KGDGTPYKVF SPFFDAWRRH GWRAPAQSGP DSATWIDPAD LTGRDLRTEI
PDDGATLDIP AGERAAAQHW RAFVADELDG YADNRNRPDL DVTSRMSAHL KFGTIHPRTM
VDDLGRGKGA QAYLRELAFR DFYAAVLHEW PRSVWWNWNT GFDGIRVDEG AVAEQRFDAW
KRGRTGFPIV DAGMRQLAGI GWMHNRVRMI VASFLVKDLH LPWQWGARWF LEQLVDGDMA
NNQHGWQWTA GCGTDAAPFF RVFNPSTQGA KFDPDGTYVR RWVPELKGVA DVHKMGDDRP
ADYPAPIVDH AAERAEALRR YAEIS