Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2794 |
Symbol | |
ID | 7873203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3024988 |
End bp | 3025980 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643699716 |
Product | integrase family protein |
Protein accession | YP_002889771 |
Protein GI | 237653457 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCAA TCAGGCAACG CGGGAACAGG TGGCAATGTC GCGTCACACG GCACGGCTTC CCGCCCGAAA CCAAGTCCTT CGCAACGAAG GCGGACGCGG AAACATGGGC ACGATCCATC GAAGTCGAGA TGGACAAGGG CGTCCATCAG AACCGTGCAT CAGTCGAGCG AACAACGCTT GCAGACATCC TGCTGCGGTA CGCCGAAGAG GTCACGCCCT GCAAGAAGGG GGCGAAGGAT GAAGCCATCC GGCTGAACGC CCTGCGGGCA AACAAGCTCG CCAAGCATTC ACTGGCAAAC CTCAGCGCCG CAGCGGTGGC GAAGTTCCGC GACGAGCGTT TGAAGACCGT ATCGGCGGGA ACCGTGCTGC GTGACCTCGC CCTGATTTCG TCGGTCCTGA ACCATGCGCG CAGGGAATGG GGCTTCCCGG TTGAAAACGC CGTTCAGGCG ATCCGTAAGC CCCGCCAGCC TCAAGGGCGG GAACGCGTGC TATCGCACGA CGAAGAAGCC CGCCTGCTGG CCGCATCGGC CCCTATCGGG CGTCGTAGTC CCTGGCTCCA GCCGATCATC ATCCTTGCAC TGGAAACGGC CATGCGGCGC GGCGAACTGC TTGCGCTGCG GTGGGAGCAT GTCAGCCTCG ACAAGCGCAC CGCCCTACTT CCCGACACCA AAAACGGCAC ACGACGCCTT GTGCCGCTTT CACCCCGCGC AATCGACACC CTCAAGCACA TGCCGCGCGC CATTGATGGG CGCGTGTTCC CAATCTCGGA ACCAGCCCTG CACCTGCGGT TCAAGCTGGC GTGCGACCGA GCGGGAATCG ACGGGCTCCA CTTCCACGAC TTGCGACACA CGGCAACCAC CAGACTTGCC GAGAAGCTGA CCAACCTTGC GGAGTTGTCC GCCGTGACCG GACACAAATC GCTCCAGATG CTGAAGCGGT ACTACCACCC CAACGCCGAA GCACTCGCGG AAAAGCTGGC GCGACACGGT TAA
|
Protein sequence | MASIRQRGNR WQCRVTRHGF PPETKSFATK ADAETWARSI EVEMDKGVHQ NRASVERTTL ADILLRYAEE VTPCKKGAKD EAIRLNALRA NKLAKHSLAN LSAAAVAKFR DERLKTVSAG TVLRDLALIS SVLNHARREW GFPVENAVQA IRKPRQPQGR ERVLSHDEEA RLLAASAPIG RRSPWLQPII ILALETAMRR GELLALRWEH VSLDKRTALL PDTKNGTRRL VPLSPRAIDT LKHMPRAIDG RVFPISEPAL HLRFKLACDR AGIDGLHFHD LRHTATTRLA EKLTNLAELS AVTGHKSLQM LKRYYHPNAE ALAEKLARHG
|
| |