Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3173 |
Symbol | |
ID | 7874314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3446529 |
End bp | 3448019 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700102 |
Product | Integrase catalytic region |
Protein accession | YP_002890146 |
Protein GI | 237653832 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTGC TGGGGAAGGT CAGAAGGCTG TACTACCGGG AGGGGCTCAC GCTCTCGGAG ATCGAGCGTC GCACGGGGCT GACGCGCAAG ACGATCCGCA AGTGGCTGAA GGCGCCCGAG GGGGTTGAGC CGAAGTATCG ACGCAAGGTC GTCGACACCA AGATCGCGCC GTACGCGGCG CAGTTGGTCA AGATGTTGGA GACCGATGCG CGGCGACCGC AACGGGACCG ACGCACGGCG CTGAAGCTGT TTGGTCAGTT GCAGGTGCAG GGGTTCGACG GCGATTACAG CCGCGTCACC GAGTTCATCC GGCGCTGGAG GGCCGAAGGC GGTGCCTCGG TCACGAAGGC CTTCGTTCCG CTGCAGTTCG AGCCGGGAGA GGCGCACCAG TTCGACTGGA GCGAGGAGCA TCTGGTCATC GGTGGGGTGT GGCGCAAGGT GCTCGTGGCG CACCTGAAGC TGTGCTACAG CCGTGCCTTT GTCGTGCAGG CGTATCCGAC GCAGAGCCAC GAGATGCTGT TCGATGCGCA CGCGCGGGCG TTTGCTGCGC TCGGGGGCGT GGCCCGGCGC GGGATCTACG ACAACATGAA GACGGCCGTC GACCGGGTGA AGAAAGGCAA GGCGCGGGTG GTCAATACAC GCTTTGCGGC GATGGCCTCG CACTACCTGT TCGAAGCGGA CTTCTGCAAT GTTGCCAGCG GTTGGGAGAA AGGCGTCGTC GAGAAGAACG TGCAGGACAG CCGACGGCGG ATCTGGCAAG AGGCGGCCGA GACGCGCTTT GGTTCCTTCA CCGAGCTGAA CCTGTGGTTG CTGGCGAAGT GTCGCAGCCT GTGGCAGGAG CTCAAACATC CGGAGTACGA CCTCAGCCTG GCGGAGATGC TCGAGCAGGA GCAGCCCAGC CTGATGCCGA TGATCACGCC CTTCGACGGC TACGTGGAGA CGCTGGGCAA GGTCTCGAGC ACCTGCCTGG TTACGTTCGA GCGCTGCCGC TATTCAGCGC CGTGTGAGTT GGTGGGTCAG ATGGTCGGCA TCCGGGTGTA TCCCGAGCAG ATCGAGCTCG TCGCGCACGA CACCGTCGTG GCCCGTCACG TGCGCAGCTT CACGCGCAAC GAAGCGCGCT ACGACTGGCA GCACTATCTG CCGCTGATCG AGCGCAAGCC CGGTGCGCTC AGGAACGGCG CACCCTTCGC CGACATGCCC GCGCCACTGC AGACGCTGCG CGCACTGCTG CTCAAGCGCG AAGGCGGAGA TCGGGTCATG GCGAAGGTGC TCGCCGCCGT GCCTCAGCAT GGCCTGGGGG CGGTCATCGT GGCGGTGGAG TTGGCGCTCG AAGCGGGCCG CCCCAGCCCC GAGCACATCG AGAACGTGCT CAATCGCCTG AAGTCGGCCC CTGCCACGCC GACCGTCGAC ACGGTACTGA CCCTGAGCGA AACACCGGTG GCGGACCCGC AGCGTTACGA CCGGCTGCGT GCGGAGGTGA GCCATGCGTG A
|
Protein sequence | MNLLGKVRRL YYREGLTLSE IERRTGLTRK TIRKWLKAPE GVEPKYRRKV VDTKIAPYAA QLVKMLETDA RRPQRDRRTA LKLFGQLQVQ GFDGDYSRVT EFIRRWRAEG GASVTKAFVP LQFEPGEAHQ FDWSEEHLVI GGVWRKVLVA HLKLCYSRAF VVQAYPTQSH EMLFDAHARA FAALGGVARR GIYDNMKTAV DRVKKGKARV VNTRFAAMAS HYLFEADFCN VASGWEKGVV EKNVQDSRRR IWQEAAETRF GSFTELNLWL LAKCRSLWQE LKHPEYDLSL AEMLEQEQPS LMPMITPFDG YVETLGKVSS TCLVTFERCR YSAPCELVGQ MVGIRVYPEQ IELVAHDTVV ARHVRSFTRN EARYDWQHYL PLIERKPGAL RNGAPFADMP APLQTLRALL LKREGGDRVM AKVLAAVPQH GLGAVIVAVE LALEAGRPSP EHIENVLNRL KSAPATPTVD TVLTLSETPV ADPQRYDRLR AEVSHA
|
| |