Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3892 |
Symbol | |
ID | 7873541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4288511 |
End bp | 4290196 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700832 |
Product | transposase IS4 family protein |
Protein accession | YP_002890855 |
Protein GI | 237654541 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCTGA GGCAGAGCAA ACAGAAGCGG GCAGACGGGC GGGCCATCAC CTACCTGCAA CTGGCCGAGA ACGTGTGGGA TGCCTCGAAG GGGCGCTCAC AGGCCCAGAT CGTGTTCAAC TGCGGCCGTG CCGACGACCC GGAGGTGATC GAGCGGCTGC GCCGTCTGGC CAAGAGCATC CTGCGCCGAT GCTCGCCCGA GGCCATCGTC GCGGACGATC CAGGCTGGCG GTTGGTGTGC GCCTGGCCCT ACGGCGATGT GTACGCGCTT CAGGCGGTGT GGCGGCGGCT CGGGATCGAC GCCATCGTGC GTGCGCAGGC CAAGGGGCGG CGCTTTGGCT TCGAGATGGA GCGGGCCTTG TTCGCGCTGG TGGCCAATCG CGCTTGTGCG CCGGCCTCTA AACTCTACTG TCACGAGCAG TGGCTCAAGG AAGACGTGCA TATCGAAGGC ACGCAGGCCC TGGCGCTGCA CCAGCTCTAC CGCGCGATGG ACTTCCTCGA GGCGAACAAG GCTGCCATCG AGCAGGCGAT CTTCTTCCAG GTCGCCGACC TGTTGAGTCT GGACGTGGAG ATCGTCTTCT ACGACACCAC CTCATTGCAC TTCGAGATCG ACAACGAGGA TGAGGGCGAG CCCGATGGCC AGATGCGCGG CAGCGTTGCG GCCGGCGCCA AGCGCTACGT GGCGCCCAGA AAGCGCGGCT ACAGCAAGAA CGGCCGGGGT GATGCGCCGC AGATCGTCGT CGGGCTGGCG GTCACCCGCG ACGGCTTCCC GGTGCGCCAC TGGGTGTTTC CGGGCAACAC CGTCGATGTC ACCACGGTGG CCCAGGTCAA GGAAGATCTG AAGGGCTGGC AGCTCACCCG CTGCCTGTTC GTCGGGGATG CCGGGATGGT CTCGCAGGCG AACTTCCAGG CGCTCGCCAA GGGCGGTGGC AAGTACCTGA TGGCGATGCC GATGCGCCGT GGCGACGAGG TCACCGAAAC GGTGCTCTCG CGCCCGGGCC GCTACCGCAA GATCGCCGAC AACCTCGAAG TCAAGGAAGT CATCGTCGGC GACGGCGAGC GCCGCCGCCG CTACGCGGTG TGCTTCAACC CGCAGGAGGC ACATCGCCAG CGCACCTATC GCGCCGAGCG CATCCGTGAG CTCGAGGCCG AACTCGCCTG CCTGGCCGAT CAGGACGAGG GCGGGCACAG CAAGCGCGTG TGTGCGCTAC GTAGCAGCGC CCGCTACGGG CGGTTCTTGA AAGAGACCAA GCGCGGCCTG GCGATCGACC GCCAGGCCAT CGCCGAACTC GAACGCTTCG ACGGCAAGTT CGTCGTCCAC AGCAACGACG ACACCCTCAC GGCCGAGGAC ATGGCGCTCG GCTACAAGCA GCAGCAGCGC GTCGAAGAGG CCTGGCGCAC GATGAAGGGC GGCCTGCGCA TGCGCCCAGT GTTCCACTGG GCACCGCACC GCATCCACGC CCACATTGCG ATCACGGTGC TCGCCTTGCT GCTGGAGCGC GTGATCGAGC ATGCCTGCCA GGACACCTGG CGCAACATCC GCGACGACCT CAAGCGCATT CAGCTTGCGC AATTGTCCAG CCCCAACGGC ACCGCCTGGC AGGTCACCGA GCCTACGACG GAAGCGGCCA ACCGACTGAA AGCATTGAAG ATCAAGCCGC CGCCGGCCAT CCTCAAGCTC GACTGA
|
Protein sequence | MYLRQSKQKR ADGRAITYLQ LAENVWDASK GRSQAQIVFN CGRADDPEVI ERLRRLAKSI LRRCSPEAIV ADDPGWRLVC AWPYGDVYAL QAVWRRLGID AIVRAQAKGR RFGFEMERAL FALVANRACA PASKLYCHEQ WLKEDVHIEG TQALALHQLY RAMDFLEANK AAIEQAIFFQ VADLLSLDVE IVFYDTTSLH FEIDNEDEGE PDGQMRGSVA AGAKRYVAPR KRGYSKNGRG DAPQIVVGLA VTRDGFPVRH WVFPGNTVDV TTVAQVKEDL KGWQLTRCLF VGDAGMVSQA NFQALAKGGG KYLMAMPMRR GDEVTETVLS RPGRYRKIAD NLEVKEVIVG DGERRRRYAV CFNPQEAHRQ RTYRAERIRE LEAELACLAD QDEGGHSKRV CALRSSARYG RFLKETKRGL AIDRQAIAEL ERFDGKFVVH SNDDTLTAED MALGYKQQQR VEEAWRTMKG GLRMRPVFHW APHRIHAHIA ITVLALLLER VIEHACQDTW RNIRDDLKRI QLAQLSSPNG TAWQVTEPTT EAANRLKALK IKPPPAILKL D
|
| |