Gene Mjls_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2939 
Symbol 
ID4881536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3060682 
End bp3062172 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content69% 
IMG OID640140234 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_001071209 
Protein GI126435518 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.119687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGTA CGCGTGACGA CACCCCACTG TGGCTCTTCG CCGACCAACT CGGCCCGGCC 
GTCCACGGCG GCGAGCACGC CCACCGCGAC GTGCTGCTCA TCGAGGCCGA CCACGCCCTG
CGCAAGCGCC GCTACCACCG CCAGAAACTG CACATCGTGC TGTCCGCGCT GCGCCACGCC
GACCGCGACC TCGGCGACCG CGCCACCCTC CTCCGCTCCG AGACCTACAC CGACGCGCTC
GAACGCTACG GCCGGCCCGT CCTCGTCCAC GAGCCGACGT CCTTCGCCGC CGAGAGGTTC
GTCCACCGCC TCAAACAGCG TGGCCTCGTC GCCGACATCC TGCCCACCCC GACATTCGCG
TTGCCGCGCA AGGACTTCGA ACAGTGGGCC GGGAACCGCA CCCGGTTCCG CATGGAGGAC
TTCTACCGCG ACCAACGCCG CCGCTTCGAC GTCCTGATGA GCGGGGCCGA TCCCGTCGGC
AACCGGTGGA ACTACGACGA GGAGAACCGC CACTCCCCAC CGAAGAAGCG GCGCACCCTC
GACGTGCCCG CGCCGTACAA GCCCCGCGAG GACGACATCG ACGAAGAGGT CCGCCGCGAC
CTCGACCGGA TGGACCTCGA CACCGTCGGC GCCGACGGCC CCCGCCTGTT CGCCGTCACA
CCCGCCGAAG CCAAACGCGC CCTCACCCGC TTCATCGAGC ACCGCCTGCC GACCTTCGGC
GACTACGAGG ACGCGATGAT GGGCGAGGAC TGGGCGATGT CGCACTCACT GTTGTCGGTG
CCGCTCAACC TCGGCGTGCT CCACCCCCTC GACGCTGTGT ACGCCGCCGA ACAGGCCTAC
CGCGACGGGA CCGCGCCGCT GGCTGCCGTC GAGGGGTTCA TCCGCCAGAT CCTCGGCTGG
CGCGAGTACA TGTGGCATCT CTACTGGCAT TTCGGCGAGC GGTACGTCGA CAGCAACGAA
CTCGACGCCA GGACACCGCT TCCGGACTGG TGGGCCGACC TCGACGCCGA CGCCGTGACC
GCCGAATGCC TGCGCCACGC GCTGATGGGG CTTCGTGACC GGGGCTGGAC GCACCACATC
CAGCGGCTGA TGATCCTCGG CAGCCACGCC CTGCAGCGCG GATACCACCC TCGCGAACTC
ACCGAGTGGT ACGCCACCGC CTACGTCGAC GGCTTCCGCT GGGTCATGCC CACCAACGTC
GTCGGGATGA GCCAGCACGC CGACGGTGGC ATGCTCGCCA CCAAGCCGTA CACCTCCGGC
GGCGCCTACA TCAACAAGAT GAGCGACCAC TGCGGCGACT GCGCCTACGA CCCGCGTAAA
CGCCTCGGCG AGGACGCCTG CCCGTTCACG GCCGGCTACT GGGCCTTCGT GCACCGCCAC
CGCGACCGGC TCGAGCGCAA CATGCGCACC CGCCGGGCGG TACAGGGGTT GAACCGGCTC
GGCGACCTCG AGGACGTCCT CGCCCAGGAG GACAAGCGCA CACGGTTCTA G
 
Protein sequence
MTGTRDDTPL WLFADQLGPA VHGGEHAHRD VLLIEADHAL RKRRYHRQKL HIVLSALRHA 
DRDLGDRATL LRSETYTDAL ERYGRPVLVH EPTSFAAERF VHRLKQRGLV ADILPTPTFA
LPRKDFEQWA GNRTRFRMED FYRDQRRRFD VLMSGADPVG NRWNYDEENR HSPPKKRRTL
DVPAPYKPRE DDIDEEVRRD LDRMDLDTVG ADGPRLFAVT PAEAKRALTR FIEHRLPTFG
DYEDAMMGED WAMSHSLLSV PLNLGVLHPL DAVYAAEQAY RDGTAPLAAV EGFIRQILGW
REYMWHLYWH FGERYVDSNE LDARTPLPDW WADLDADAVT AECLRHALMG LRDRGWTHHI
QRLMILGSHA LQRGYHPREL TEWYATAYVD GFRWVMPTNV VGMSQHADGG MLATKPYTSG
GAYINKMSDH CGDCAYDPRK RLGEDACPFT AGYWAFVHRH RDRLERNMRT RRAVQGLNRL
GDLEDVLAQE DKRTRF