Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3953 |
Symbol | |
ID | 7873599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4349483 |
End bp | 4351030 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700890 |
Product | cryptochrome, DASH family |
Protein accession | YP_002890913 |
Protein GI | 237654599 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | [TIGR02765] cryptochrome, DASH family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCGG ACGTGGTGAT CTTCTGGTTT CGCAACGATC TGCGCGTGGG TGACAACCTC GCGCTGGTGG AGGCCTGTGC GTCCGCCGGG CGCCTGCTTC CGGTTTACTG CCACGACCCC TCCGCCGATG CACCGACGCG CTGGGGCTTC GTGCGCCGCG GGCCGCATCG GCGCGCCTTC CTGGAGGCGG CGCTGGGCGA CCTGGATGCG GCCCTGCGGG CGCGCGGCAG CGCGCTGCTG CAGCTGAACG GCGCGCCGCG TGCGGTGCTG CCGGCGCTCG CGGAAGCCGT GGGGGCGGCC GTGGTGGTGT GCGAGGCGAT CGCGGCACCC GAGGAGGAGG ACGAGCTCGC CGCGCTACGC GCCGCCGGGC TGAAGGTGAA GGCGATATGG CAGTCCACCC TGCTCGACCC CGCGGCGCTG CCTTTCGCGG CGGCGCGCCT GCCCAAGGTG TTCACCGCCT TTCGCCAGGC AGTGGAGTCC GCCGGCCTGC AGCCGCCCGC GCCCCTGCCG GCGCCGCGGA CCCTGCCGCC GTGGCCACCG CTCGATCGAC CGACCCGCAG GAGCGACGCA AGTCGCGATC TCCGCGCCGG GCGCTCGCCG GGGGTGGCCG GTCGCGATCC TTGCGCCGGG CGCTCGCCGG GGGTGGAGCG TGCGGAACCG GGGAATCGCG ACTTGCGTCG CTCCTACGGG GGCTCCTGCG GGGCCGCCGA TGCGCTCACC GTGGAAATCG GCGAACAGGC CTCGTTTCCG TGGTGGCAGC CGGCATTCGC CGGGGGCGAG CGTGCGGCGC TCGCGCATCT GGCACGCTAT TTCGCCGGCG ATCGGCCGCG GCACTACAAG GCGACGCGCA ACGGGCTGAG CGGCGTCGAC TTCTCCAGCA AGTTCTCGCC CTGGCTGGCG CAGGGCGCCT TGTCGGCGCG GGTGGCCTTG GCCGCGCTGC GCCGCCACGA GATCGAGCAG GGGCCGAGCG ACGGAAGCTA CTGGCTGTGG TTCGAGCTGC TGTGGCGCGA CTACTTCCGC CTCCTGCATC TGCAGCAAGG CCGCCGTCTG TACCGGGCGC GCGGGCTGAA CGAAGGCGCA GCGCAGCCCG CGCACGACGC CGCTGCGTTC GCGGCCTGGC GCGTGGGCGG CACCGGCCAT CCCTTCATCG ACGCCGGCAT GCGCGAGCTC GCGGCCACCG GCTGGCTGTC GAACCGCATG CGCCAGATCG TCGCCAGCTA CCTGATCCAC GACCTCGGCT GTGACTGGCG TGCCGGCGCG GCGTGGTTCG AGGCGCAGCT GGTCGATTAC GACGTGTATA GCAACCAGGG CAACTGGCTC TACATCGCCG GCCGCGGCAC CGATCCGCGT GGCGGGCGGC GCTTCGATCC GGACCGGCAG GCGGCGATGT ACGACGCGGA CGGCGCCTAT CGGGCGCTGT GGGCCGAGCC CGAGCCGCGC ACGCGGGCCA GCGCCGCTGG CGGCAGCGCG CCGAGCAGGG CCCCGGCCGA CACCACCACG ATGCCGACGA TGGTGGTCGT GCCGAGGGCC TCGTCCAGGA TCAGATAG
|
Protein sequence | MLSDVVIFWF RNDLRVGDNL ALVEACASAG RLLPVYCHDP SADAPTRWGF VRRGPHRRAF LEAALGDLDA ALRARGSALL QLNGAPRAVL PALAEAVGAA VVVCEAIAAP EEEDELAALR AAGLKVKAIW QSTLLDPAAL PFAAARLPKV FTAFRQAVES AGLQPPAPLP APRTLPPWPP LDRPTRRSDA SRDLRAGRSP GVAGRDPCAG RSPGVERAEP GNRDLRRSYG GSCGAADALT VEIGEQASFP WWQPAFAGGE RAALAHLARY FAGDRPRHYK ATRNGLSGVD FSSKFSPWLA QGALSARVAL AALRRHEIEQ GPSDGSYWLW FELLWRDYFR LLHLQQGRRL YRARGLNEGA AQPAHDAAAF AAWRVGGTGH PFIDAGMREL AATGWLSNRM RQIVASYLIH DLGCDWRAGA AWFEAQLVDY DVYSNQGNWL YIAGRGTDPR GGRRFDPDRQ AAMYDADGAY RALWAEPEPR TRASAAGGSA PSRAPADTTT MPTMVVVPRA SSRIR
|
| |