Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0317 |
Symbol | |
ID | 7085618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 358262 |
End bp | 359752 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697355 |
Product | transposase, IS21 family |
Protein accession | YP_002354003 |
Protein GI | 217968769 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0223423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTGT CCCCTGAACA CGAAGCGCAG ATCCTGCGCT ACTACCACGC CGAGCGCTGG CGCATCGGCA CCATCGCCGT GCAGCTCGGG CTGCACCGCG ACACCGTCGC GCGCGTGCTC GCCCAGGCCG GCCTGCCCCG GCACGGCCCC GTGCAGCGCG CCTCGGCGAT CGACCCCTAT CTACCCTTCC TCCACGAGAC GCTCGCGCAG TTTCCGCGCC TTACGGCCGC GCGGCTCTAC GACATGGTGC GCGCACGCGG TTACCCCGGG CGCCCCGATC ACTTCCGCCA CCTCATCGCC CGCCACCGCC CGCGCCCGAG CGCCGAGGCC TACCTGCGGC TACGCACCCT GCCCGGCGAG CAGGCCCAGG TCGACTGGGG GCACTTCGGG CACCTGACGA TCGGGCGCGC ACGCCGTCCG CTGATGGCCT TCGTCATGGT GCTGTCGTGG TCGCGCCAGA TCTATCTGCG CTTCTTCCTC GATGCGCGCA TGGAGAACTT CCTGCGCGGT CACGTGGGCG CCTTCGCGCA CTGGGGGGCG GTGCCGCGCA TCGCCCTCTA CGACAATCTG AAGAGCGCGG TGCTCGAGCG CTGTGGCAAC GCGATCCGCT TCCATCCCAC CTTGCTCGCG CTCGCTGGCC ATTACCGCTT CGAGCCGCGC CCAGTGGCGG TGGCGCGTGG CAACGAGAAG GGACGCGTCG AGCGCGCGAT CCGCTACGTG CGCGAGGCCT TCTTCGCCGG ACGCCCCTTC GCCGACCTGG ATGATCTGAA CGCGCAGGCC CAGGCCTGGT GCGAGGGCGC CGCCGGCGCG CGGCGCTGCC CCGAGGACGC CTCGATGACG GTGACCGAGG CCTTCGCGGC CGAGCGCGAG CGCTTGCTCG CGCTGCCCGA GGCGCCGTTC CCGACCGATG AGCTGCGCGC GGTATCGGCG GGCAAGACTC CGTACGTGCG CTTCGATCTG AACGACTACT CGATCCCCCA CACCCATGTG CAGCGCCCCC TCACCGTGTG CGCCGACCCG CTGCGGGTGC GCATCCTCGA CGGCGAGGAC GTCATCGCCA CCCACGCGCG CAGCTACGAC CGCCGCCAGC AGATCGAGTG TGCCGCGCAC CTCGAGGCGC TCGTCGCGCA CAAGCACGCG GCCCGCGCCC ACCGCGCCAC CGACCGCCTG ACGGCGGCCG TGCCCACCTG CCAGGCGCTG CTCGCCCAGG CCGCCGAGCG CGGCGAGCCG CTCGGGCGCA CCACGCGCGC GCTCACCGAC CTGCTCGACC GCTACGGCGC GGGCGAACTG GCCGTTGCCG TCGACGAGGC GCTCGCGCGC GGCGTGCCGC ATCCCAACGC GGTGCGCCTG GCGCTCGAGC GCCGGCGCGA GGCGCCCCCG CCGCTGGGTG TGCCGCTGCC CGCGCATCTG AAGACGCGCG ACGTCACCGT GCGCGCCCAC CCCTTGGCCG GCTACGACCG CCTGCTGGAG GACGACCATG ACGACGCCTG A
|
Protein sequence | MALSPEHEAQ ILRYYHAERW RIGTIAVQLG LHRDTVARVL AQAGLPRHGP VQRASAIDPY LPFLHETLAQ FPRLTAARLY DMVRARGYPG RPDHFRHLIA RHRPRPSAEA YLRLRTLPGE QAQVDWGHFG HLTIGRARRP LMAFVMVLSW SRQIYLRFFL DARMENFLRG HVGAFAHWGA VPRIALYDNL KSAVLERCGN AIRFHPTLLA LAGHYRFEPR PVAVARGNEK GRVERAIRYV REAFFAGRPF ADLDDLNAQA QAWCEGAAGA RRCPEDASMT VTEAFAAERE RLLALPEAPF PTDELRAVSA GKTPYVRFDL NDYSIPHTHV QRPLTVCADP LRVRILDGED VIATHARSYD RRQQIECAAH LEALVAHKHA ARAHRATDRL TAAVPTCQAL LAQAAERGEP LGRTTRALTD LLDRYGAGEL AVAVDEALAR GVPHPNAVRL ALERRREAPP PLGVPLPAHL KTRDVTVRAH PLAGYDRLLE DDHDDA
|
| |