Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1070 |
Symbol | |
ID | 7084054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1167994 |
End bp | 1169484 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643698088 |
Product | Integrase catalytic region |
Protein accession | YP_002354728 |
Protein GI | 217969494 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.47791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTGC TGGGGAAGGT CAGAAGGCTG TACTACCGGG AGGGGCTCAC GCTCTCGGAG ATCGAGCGTC GCACGGGGCT GACGCGCAAG ACGATCCGCA AGTGGCTGAA GGCGCCCGAG GGGGTTGAGC CGAAGTATCG ACGCAAGGTC GTCGACACCA AGATCGCGCC GTACGCGGCG CAGTTGGTCA AGATGTTGGA GACCGATGCG CGGCGACCGC AACGGGACCG ACGCACGGCG CTGAAGCTGT TTGGTCAGTT GCAGGTGCAG GGGTTCGACG GCGATTACAG CCGCGTCACC GAGTTCATCC GGCGCTGGAG GGCCGAAGGC GGTGCCTCGG TCACGAAGGC CTTCGTTCCG CTGCAGTTCG AGCCGGGAGA GGCGCACCAG TTCGACTGGA GCGAGGAGCA TCTGGTCATC GGTGGGGTGT GGCGCAAGGT GCTCGTGGCG CACCTGAAGC TGTGCTACAG CCGTGCCTTT GTCGTGCAGG CGTATCCGAC GCAGAGCCAC GAGATGCTGT TCGATGCGCA CGCGCGGGCG TTTGCTGCGC TCGGGGGCGT GGCCCGGCGC GGGATCTACG ACAACATGAA GACGGCCGTC GACCGGGTGA AGAAAGGCAA GGCGCGGGTG GTCAATACAC GCTTTGCGGC GATGGCCTCG CACTACCTGT TCGAAGCGGA CTTCTGCAAT GTTGCCAGCG GTTGGGAGAA AGGCGTCGTC GAGAAGAACG TGCAGGACAG CCGACGGCGG ATCTGGCAAG AGGCGGCCGA GACGCGCTTT GGTTCCTTCA CCGAGCTGAA CCTGTGGTTG CTGGCGAAGT GTCGCAGCCT GTGGCAGGAG CTCAAACATC CGGAGTACGA CCTCAGCCTG GCGGAGATGC TCGAGCAGGA GCAGCCCAGC CTGATGCCGA TGATCACGCC CTTCGACGGC TACGTGGAGA CGCTGGGCAA GGTCTCGAGC ACCTGCCTGG TTACGTTCGA GCGCTGCCGC TATTCAGCGC CGTGTGAGTT GGTGGGTCAG ATGGTCGGCA TCCGGGTGTA TCCCGAGCAG ATCGAGCTCG TCGCGCACGA CACCGTCGTG GCCCGTCACG TGCGCAGCTT TGCGCGCAAC GAAGCGCGCT ACGACTGGCA GCACTATGTG TCGCTGATCG AGCGCAAGCC GGGCGCGCTC AGGAATGGCG CGCCCTTCGC CGACATGCCT GCGCCATTGC AAGCGCTGCG CGCGTTGCTG CTCAAGCGCG AAGGGGGAGA TCGGGTCATG GCGAAGGTGC TCGCCGCCGT GCCTCAGCAT GGCCTGGGGG CGGTGATCGT GGCGGTGGAG TTGGCGCTCG AAGCGGGCCG CCCCAGCCCC GAGCACATCG AGAACGTGCT CAACCGCCTG AAGTCAGCCC CTGCCACGCC GACCGTCGAC ACGGTACTGA CCCTCAGCGA AACACCGGTG GCAGACCCGC AGCGTTACGA CCGGCTGCGT GCGGAGGTGA GCCATGCGTG A
|
Protein sequence | MNLLGKVRRL YYREGLTLSE IERRTGLTRK TIRKWLKAPE GVEPKYRRKV VDTKIAPYAA QLVKMLETDA RRPQRDRRTA LKLFGQLQVQ GFDGDYSRVT EFIRRWRAEG GASVTKAFVP LQFEPGEAHQ FDWSEEHLVI GGVWRKVLVA HLKLCYSRAF VVQAYPTQSH EMLFDAHARA FAALGGVARR GIYDNMKTAV DRVKKGKARV VNTRFAAMAS HYLFEADFCN VASGWEKGVV EKNVQDSRRR IWQEAAETRF GSFTELNLWL LAKCRSLWQE LKHPEYDLSL AEMLEQEQPS LMPMITPFDG YVETLGKVSS TCLVTFERCR YSAPCELVGQ MVGIRVYPEQ IELVAHDTVV ARHVRSFARN EARYDWQHYV SLIERKPGAL RNGAPFADMP APLQALRALL LKREGGDRVM AKVLAAVPQH GLGAVIVAVE LALEAGRPSP EHIENVLNRL KSAPATPTVD TVLTLSETPV ADPQRYDRLR AEVSHA
|
| |