Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1604 |
Symbol | |
ID | 7084814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1797692 |
End bp | 1799731 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698624 |
Product | hypothetical protein |
Protein accession | YP_002355255 |
Protein GI | 217970021 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4389] Site-specific recombinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0288386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAAC TGCTCGAACG CTTCGGCCAG CCGGACCAGG ATCCGGTCCG GCTGTGGTCC GCACTGGTCG ACAAGCTGCG CCCCGCGCGC CCGACCGACA CCGACCAGGC CACCGGGAAC CTGCGCGAAC TCGGCAACCT GCTCGCGCGC CGGCCCGATC TGCTGGCGAA CCTGCGCACG GCGGTGCTGG AGCTCTTCGC CGAGCGCAAG CAGGTGACGA TGTACGTCTC CTCCGGCCTC CTGCCCTCGA CCGGGTTCTT CTCCGAGACC TCGCGCAGGT TCGGCAGCCG CATGCTGCCC GAGCTGCTCG ACACCACCCA CCTCAAGGAC CTCCTCTCCG CGGTCTTCCA CCGCGTGGAT GACGAAGTCT GGGTGCGCGC CATCGCGGAC GAGGCCTGGC AGGACTTCCT CCGCCTGCTG GTCGGACACC AGACCCCGAT GTTCGAGGAA GACGCCAGCC CGCTGCCGAG CGCGGTGGGC GAGATCCTCG AGTCGCTGCG CGTGCTCTCC TTCCACGTCT CGGCGATCGG GCTCGACCGC GAGCTGGTGC GCATCGATCC CCACCTCGAG GAGCACGAAT CCCCCTTCCT CGCGCAGAAC GCCGAGCTGC TCGCCTACAT CGGCCACTAC AAGGCGTGGT GGACCACCCC GGGCGCCCTG ATCGCCGACG ACAAGCACCT CACCGTGATG CTGCACCAGT GCGACGAGGT GCTGCAGCGG GTGCGCAAGC GCGCGATGCG GGTGGGCACC AGCCTGACGC TGACCTTCAA GCTCGAGCGC CTGCGCCAGC ACCTGGAGCG CATCCACGAG CTCATCGCGC TGCTCGGCGA GCTGCGCATC CGGCGCGTGG TCGAGGACGC CGCGCCCCGC ATCGTGCGCT TGTTCAAGAC CCTGGTGCGC GCCGAGTGCC GCAAGAACAT CCTCTCCGAC TACTGGGGCC AGAACGTCGA GCTGCTGTCG CTGCGCATGA CCGAGAGCGC GAGCCGCACC GGCGAGCACT ACATCACCAG TTCGCGCAGC GAGTATTTCG GGCTGTTCGC ATCGGCCGCG CTCGGCGGGC TGATCATCGC CTTCATGGCT GGCAACAAGG TCGTGCTCGG CAGCCAGGGC ATGGCCCCGC TCAACGAGCT GCTGTCCTTC TGCCTCAACT ACGGACTGGG CTTCGCGCTG ATCCACATGC TAGGCGGCAC GGTCGCCACC AAGCAGCCGG CGATGACCGC AAACGCGATC GCCGCCTCGA TCGGCGAGGC CAGGGGCAAG ACGCGCGACC TGGAGGCGCT CGCCGACCTG ATCGTGCGCA CGATCCGCAG CCAGATCGGT GCCATCCTCG GCAACATCGG GGTGGCGATC CCGGTGGCGA TCGGCGTGGG CGTGCTGATC CACTTCGCCA CCGGCAGCCA TTTCATCAGC CCGGAGAAGG CGCACTCATT GCTGGCCGAG ATCCATCCAC TCGGCGGCGC GCTGTTCTTC GCCGCGATCG CGGGCGTGTG CCTGTTCATG TCGGGGCTGA TCGCCGCCTA TTACGACAAC CTCTCGGCCT ACAACCGCAT CCCGCAGCGG CTCACCCAGC TGCGCGGCCT GCGCCGCCTG CTCGGCGAGC GCCGCCTGCG CAAGCTCGCG GACTACGTCG AGCAGAACAT CGGTGCCCTC GTCGGCAACT TCTTCTTCGG TTTCCTGCTC GGCGGCGCCA CCGGCCTGGG AGTACTGTTC GGCCTGCCGA TCGACATCCG CCACATCGCC TTCTCGTCGG CCTACCTCGG CTACGCCGCG GCGGCGCTCG ACTTCTCGAT CCCGCTGACG ACCGCGGCGA TCGCCTTCGC CGGCGTGCTG CTGATCGGCC TCACCAACCT GACGGTGAGC TTCGCGCTGA CGCTGAGCGT GGCGATGCGC GCACGCCGCA TCACCTTCGC GCAGAGCCGC TCGCTCGGCG GCCTGCTGCT GCGCCGCCTG CTGCGCACGC CCCATGCCTT CCTGTTCCCG CCGCGCACAG ACCAAGGCGT GGATCAAAGC GCGCTTGGTC CGACGCCGCC GAACGGCTAG
|
Protein sequence | MEKLLERFGQ PDQDPVRLWS ALVDKLRPAR PTDTDQATGN LRELGNLLAR RPDLLANLRT AVLELFAERK QVTMYVSSGL LPSTGFFSET SRRFGSRMLP ELLDTTHLKD LLSAVFHRVD DEVWVRAIAD EAWQDFLRLL VGHQTPMFEE DASPLPSAVG EILESLRVLS FHVSAIGLDR ELVRIDPHLE EHESPFLAQN AELLAYIGHY KAWWTTPGAL IADDKHLTVM LHQCDEVLQR VRKRAMRVGT SLTLTFKLER LRQHLERIHE LIALLGELRI RRVVEDAAPR IVRLFKTLVR AECRKNILSD YWGQNVELLS LRMTESASRT GEHYITSSRS EYFGLFASAA LGGLIIAFMA GNKVVLGSQG MAPLNELLSF CLNYGLGFAL IHMLGGTVAT KQPAMTANAI AASIGEARGK TRDLEALADL IVRTIRSQIG AILGNIGVAI PVAIGVGVLI HFATGSHFIS PEKAHSLLAE IHPLGGALFF AAIAGVCLFM SGLIAAYYDN LSAYNRIPQR LTQLRGLRRL LGERRLRKLA DYVEQNIGAL VGNFFFGFLL GGATGLGVLF GLPIDIRHIA FSSAYLGYAA AALDFSIPLT TAAIAFAGVL LIGLTNLTVS FALTLSVAMR ARRITFAQSR SLGGLLLRRL LRTPHAFLFP PRTDQGVDQS ALGPTPPNG
|
| |