Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3951 |
Symbol | |
ID | 7873597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4347671 |
End bp | 4349251 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643700888 |
Product | Deoxyribodipyrimidine photo-lyase |
Protein accession | YP_002890911 |
Protein GI | 237654597 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCTACG GCCTGGTCTG GTTCAAGCGC GACCTGCGCC TCGCCGATCA CGCCGCCTTG GCCACGGCCG CGCGGCGCGG GCCGGTGCTG TGCGTGCTGA TCGTCGAGCC GTCGCTGTGG GCGCAGCCCG ACGCCGCGCG CCAGCACTAC GAGTTCATGC TCGAAAGCGC ACGCGAGCTG CACGCCGGGC TGGCCCGCGT CGGCGGTCGC CTGCACCTGC TGGTGGGCGA GGCCGTCGCG GTGCTCGATC GCCTGCACGC CGCAGCGCCC TTCGACACCC TGCATTCGCA CGAGGAGACC GGCAACGCCG CGAGCTACGC GCGCGACCGC GCGGTGGCGC GCTGGTGCCG GGCGCGCGGC GTGCGCTGGC ACGAGCCGGC GCAGTTCGGC GTGGTGCGCC GGCTCGACGA CCGCGACCGC TGGCAGGCAG CGTGGGAGGC GCAGGTCGCC GCGCCGCAGG TCGAGCTGCC CGAGCCCTCG CGGCTGCGCT TCGTTGCGTT GCCCGCTGCC CTGCAGCCGG GCGCCGGCGC GACCTGGCCG GATCGCGGCG CGATCGCCGC CGTGCGTGCG CCGGCGGCCG TGGCCCTCGG GCTCGACGCC TTCGAGCCGC CCCGGCGCCA GCGTGGCGGG CGGCACGCGG CGCTGGAGGT GCTGCACGAC TTCCTCGACG CGCGCAGCGG GCAATACCGC GGCGGCATCT CCTCGCCGCT GAAGGCACCC ACCGCGTGCT CGCGGCTGTC GCCCTACCTG GCCTGGGGCT GCCTGAGCCT GCGCGAACTG GTGCAGGCCA CCCGCGCGCG CGTCGCCGCG CTGCCCGAGG GCGACCGCCG CCGCGCCGGC CTGGCGGCCT TCCTCAGCCG CCTGTACTGG CACTGCCACT TCATCCAGAA GCTGGAGAGC GAGCCGACGC TGGAGTTCCG CAACCTGCAC CGCGGCTACG ACGGCCTGCG CGAGCCGGAA TGGAACCAGG CGCATTTCGA CGCGCTGGTG GGCGGGCGCA CCGGCTGGCC ACTGGTCGAC GCCTGCGTGG CGATGCTGCG CGCGACCGGC TGGCTCAACT TCCGCATGCG CGCGATGCTG GTGTCGGTGG CGGCCTACCC GCTCTGGCTG CACTGGCGCG AGGTCGGCCT GTGGCTGGCG CGCGCCTTCC TCGACTACGA GCCCGGCATC CACTGGAGCC AGCTGCAGAT GCAGTCCGGC ACCACCGGCA TCAACACCAC CCGGGTGTAC AACCCGATCA AGCAGGCGCG CGACCACGAC CCGCAGGGCG TGTTCGTGCG GCGCTGGCTG CCGGCACTGC GGCGGGTGCC GGACACCTGG CTGTTCGAGC CCTGGCGCAT GCCGGAGTCG GTACAGGCGC GCTGCGGCGT GCGCGTCGGC GAGGACATCG CGTTGCCGGT GGTCGATCTG GAGAGCGCCA CGCGCGCCGC CAAGACGCGC ATCCACGCAC TGCGCGCCCA GCCCGAGGTG CGCGCGGCGA AGGCGGCCAT CGTCGAGCGC CACGGCTCGC GCAAGCCGCC GCAGGGGCGG CGCAAGACGG CGGCGGGGTC GGCGTCGGGA CAGCTGGACC TGGGGTTTTG A
|
Protein sequence | MSYGLVWFKR DLRLADHAAL ATAARRGPVL CVLIVEPSLW AQPDAARQHY EFMLESAREL HAGLARVGGR LHLLVGEAVA VLDRLHAAAP FDTLHSHEET GNAASYARDR AVARWCRARG VRWHEPAQFG VVRRLDDRDR WQAAWEAQVA APQVELPEPS RLRFVALPAA LQPGAGATWP DRGAIAAVRA PAAVALGLDA FEPPRRQRGG RHAALEVLHD FLDARSGQYR GGISSPLKAP TACSRLSPYL AWGCLSLREL VQATRARVAA LPEGDRRRAG LAAFLSRLYW HCHFIQKLES EPTLEFRNLH RGYDGLREPE WNQAHFDALV GGRTGWPLVD ACVAMLRATG WLNFRMRAML VSVAAYPLWL HWREVGLWLA RAFLDYEPGI HWSQLQMQSG TTGINTTRVY NPIKQARDHD PQGVFVRRWL PALRRVPDTW LFEPWRMPES VQARCGVRVG EDIALPVVDL ESATRAAKTR IHALRAQPEV RAAKAAIVER HGSRKPPQGR RKTAAGSASG QLDLGF
|
| |