Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0531 |
Symbol | |
ID | 7085145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 597246 |
End bp | 598781 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697558 |
Product | deoxyribodipyrimidine photolyase-related protein |
Protein accession | YP_002354200 |
Protein GI | 217968966 |
COG category | [R] General function prediction only |
COG ID | [COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.548466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACTC TCCGCCTGAT CCTCGGCGAC CAGCTCCACG CCGCCCACTC CTGGTTCCGC GCCCCGCGCG CCGACGCGCT GTACCTCATG ATGGAGGTGC GCAGCGAGAC CGACTACGTC CGCCACCATG CGCAGAAGGT GCTGGCGATC TTCGCCGCGA TGCGCGCCTT CGCCGCCGCG CTGCAGGCCG CCGGTCATCG CGTGCGCTAC CTGAAGATCG GCGACGCCGA CAACCGCCAG TCCTTCGCCG CCAACCTGGC GCAGGTGGCG GCCGAGTGCG GCGCGACCCG CTTCGAGCGC ATGGAGGCCG ACGAGTGGCG GGTGGAGCGC CTGCTGGACG AGGCCGCGCA GACGCTCGGC CTGCCGCAGG CGGTGGTCGG CAGCGAGCAC TTCCTCGCCG AGCGTGCCGA ACTCGCGCAG CGCTTTGCCG CCAAGGTGCC GCGCATGGAG TTCTTCTACC GCGACCTGCG CCGCCGCCAC CGCATCCTGG TCGACGCCGA CGACGTCCCC GTGGGCGGGG TGTGGAACTT CGACGCGCAG AACCGCGAGA AATGGCCGGG CGACCCGCCC GCGCCGGCAT GGCCCTGGCG CGGCCACGAC CTCTGCGCGC TGTGGGACGA GATCGTCGCG GCGGGCGTGC GAACCATCGG TGCGCCCCAT GCCGACCAGC TGCGCTGGCC GATCACCCGG CGCGAGGCGC GTGCCGGGCT GGCGCACTTC ATCGAGCACG CCCTGCCCTG GTTCGGCCCC TACCAGGACG CGATGAGCAC GCGCTCGACC ACGCTGTTCC ACTCCGGGCT GTCGTTCGCG CTCAACGTCA AGCTGCTGCA CCCGCGCGAG GTCATCGACG CCGCGCTCGC CGCCTGGCAG GCGGGCAAGG TGGAGCTGGC GAGCTGCGAG GGCTTCGTGC GCCAGATCCT GGGCTGGAGG GAGTTCGTGC GCGGCGTGTA TTGGGCGCGC ATGCCGGGCT ATGCGCAGGC CAATGCGCTC GACGCCCGGC GCCCGCTGCC GGAGTGGTAC TGGAGCGGCG ACACGAAGAT GGCCTGCCTG CGCCACGCCA TCGCGCAATC GCTCGACACC GCCTACGCCC ACCACATCCA GCGCCTGATG ATCACCGGCA ACTTCGCGCT GCTGGCGGGC TGCGACCCGG ACGCGGTGGA CGCCTGGTAC CTGGGCATCT ACATCGACGC CTTCGAGTGG GTGGAGATGC CCAACACCCG CGGCATGAGC CAGTTCGCCG ACGGCGGCGT GATCGCGAGC AAGCCCTACG CCGGTGCGGC GAGCTACATC GGCAAGCAGT CGGACTATTG CAAGGGCTGC GCCTACGACC CCAGGCGCCG CCACGGCGCC AGCGGCAAGC CGGCCTGCCC CTTCAACAGC CTGTACTGGG ACTTCCTGCT GCGCCACGAG GCGCGCTTCG CACGCAACCC GCGCATGGCG ATGCCGTACA AGGCGTGGGC GAAGATGGAC GCGGGCGAGC GCGCCGCCAC GCTGGCGCAG GCCGCGCACT GGCTGGCGCG GCTGGACGAA CTCTGA
|
Protein sequence | MTTLRLILGD QLHAAHSWFR APRADALYLM MEVRSETDYV RHHAQKVLAI FAAMRAFAAA LQAAGHRVRY LKIGDADNRQ SFAANLAQVA AECGATRFER MEADEWRVER LLDEAAQTLG LPQAVVGSEH FLAERAELAQ RFAAKVPRME FFYRDLRRRH RILVDADDVP VGGVWNFDAQ NREKWPGDPP APAWPWRGHD LCALWDEIVA AGVRTIGAPH ADQLRWPITR REARAGLAHF IEHALPWFGP YQDAMSTRST TLFHSGLSFA LNVKLLHPRE VIDAALAAWQ AGKVELASCE GFVRQILGWR EFVRGVYWAR MPGYAQANAL DARRPLPEWY WSGDTKMACL RHAIAQSLDT AYAHHIQRLM ITGNFALLAG CDPDAVDAWY LGIYIDAFEW VEMPNTRGMS QFADGGVIAS KPYAGAASYI GKQSDYCKGC AYDPRRRHGA SGKPACPFNS LYWDFLLRHE ARFARNPRMA MPYKAWAKMD AGERAATLAQ AAHWLARLDE L
|
| |