Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0484 |
Symbol | |
ID | 7084995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 543732 |
End bp | 545093 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697514 |
Product | transposase IS4 family protein |
Protein accession | YP_002354156 |
Protein GI | 217968922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGCT TCGAAGTCAA GCAATCCGCC AAGCTCAATC TGACGTCCTA CTCGGGCCTG GCGTTGATCG GGCAGTGCTG CCAGGCGGCA CAGGTCGAGG CGGTGATCGA CCCGAGGCTG CCGGTGTCGC AGGGTATGCG GAGCTCGGAC CTGGTCAAGT CGGTGGTGGG GCTGCTGAGC CTGGGCAAAA GCGACTTCGA GGCGATCGAG CCGTTTCGCG GCGACCGCTT CTTCAAGGAA GCGCTCGGGC TCGCCAAGGT GCCCGGCAGC GTGTGGATGC GCCAACGGCT CGATGCCCGC GCGGCCGAGC TGCGCGAGCT GACCGACGAG CTGAGCCTGC GCCTGCTCGA GCGCACCGAG GCGCCGATCA CGGCGCACAA GGGCTACGTC TGCGCCGATC TGGACACCTT CGTGATGGAC AACTCCGACA CCAAGAAGGA GGCGGTCAGC CGCACTTACC AGGGCGTCGA TGGCTACACG CCGATCGCGC TGTACCTGGG CAACGAGGGC TGGAACCTCG GCCTGGAGCT GCGCGCGGGG TCGCACCACT CGGCGCTGGA GACCGAGTAT TTCTTCGAGC GCGCGTTCCC GCGCCTGCGC CGGGTGTGTG CGGCCGATGC GAAGCTGCTG TGGCGGGCCG ACAGCGGCTT CGACAGCGCC CGGCTGCTGT TCGCGCTGGC CGACGAGCGC GATCGCTGGG CAGCGCTGGG GCGTTCGTTC GACTACCTCA CCAAGTGGAA TCCGCGCCGT CAGGACAAGA CCGCCTGGGT GGACCGGGCC GAGGCCGCCG GCGTCTTCGA GGAAGTGCGC GCGGGCAAGC GGGTGGGGCT GCTGGACCTG AAGATCGACC GTGCCTGGAA GAAGGCCAAG CGCACGCTGC GTCTGGTGGT GCGGGTGACC GAGCGCACGA TCGACAAGAA GGGCCAGCAC CTGCTGACCC CCGAGATCGA GATCGAAGGC TGGTGGACCA GCCTCGAAGT GGCGATGGCT GACGTGATCG AGCTCTACAA GCACCACGGC ACGCACGAGC AGTTCCACTC CGAGATCAAG ACCGACCTGG ACCTCGAGCG CCTGCCCTCG GGCAAGTTCG ACACCAACGA CGCGGTCATG CATCTGGCCG CGTTCGCCTA CAACTGCCTG CGCCTGATCG GCCAACTCGG GCTGACCGGC GAGCTCTCGC CGATCCGTCA CCCGGCCAAG CGCCGACGCA TCAAGACAGT GCTGCAGGAG GTGATGTACC GTGCGGCGAA GTTCGTCGAA CACGCCCGCC GCCTGGTGCT GGACTTCGGA CGCGGCGTCG CCGCGCATGT GAAGGTGTTC ACCACGGTGC AGGCGCGACT GTGCGCGGTG GCTTCGCCGT GA
|
Protein sequence | MPRFEVKQSA KLNLTSYSGL ALIGQCCQAA QVEAVIDPRL PVSQGMRSSD LVKSVVGLLS LGKSDFEAIE PFRGDRFFKE ALGLAKVPGS VWMRQRLDAR AAELRELTDE LSLRLLERTE APITAHKGYV CADLDTFVMD NSDTKKEAVS RTYQGVDGYT PIALYLGNEG WNLGLELRAG SHHSALETEY FFERAFPRLR RVCAADAKLL WRADSGFDSA RLLFALADER DRWAALGRSF DYLTKWNPRR QDKTAWVDRA EAAGVFEEVR AGKRVGLLDL KIDRAWKKAK RTLRLVVRVT ERTIDKKGQH LLTPEIEIEG WWTSLEVAMA DVIELYKHHG THEQFHSEIK TDLDLERLPS GKFDTNDAVM HLAAFAYNCL RLIGQLGLTG ELSPIRHPAK RRRIKTVLQE VMYRAAKFVE HARRLVLDFG RGVAAHVKVF TTVQARLCAV ASP
|
| |