Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0348 |
Symbol | |
ID | 7085649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 392860 |
End bp | 394560 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643697381 |
Product | transposase IS4 family protein |
Protein accession | YP_002354029 |
Protein GI | 217968795 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0178461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGTGA AGCTCACCAC CTCCGGAGGT CGCCGCTACG TCCAACTCGT CGAGTCCTAT CGCGACGAGG CCGGGCGAGT GAAGAAGCGC ACCGTCGCCA CGCTCGGGCG TGCCGAGCAG GTCGATGGTT CGCTTGACGC GGTGATCAAC GGGCTGCTGA AGATCACCGG CCGCGAGCCG ATGGGTGCGA AGCCGCCGGC GCCGACGGTG TCGTTCGAAT CCGCGCGGGC ACTCGGTGAC GTGTGGGCGC TGACCGAGCT GTGGAACTCG CTGGGCTTCT CGGGGCTGCG TCGGGTGTTT AGCCGCACCC GCCACACCAC GGACGTGGAG GCCCTGATTC GCCTGATGGT GCTCAACCGT CTGTGCGACC CCGAATCCAA GCTCGGCGTG CTGCGCTGGG TGCACACGGT GGCGCTGCCC GACTTCAGGC CGAAGGCGGT GACGCACCAG CAGTTGCTGC GCAGCCTCGA TGCGCTCATG GATCACCAGG ACGAGGTCGA TGCGGTGGTC GCCGGGCTGC TGCGGCCACT GATCGATCAG GACTTGTCGG TGGTGTTCTA CGACCTCACC ACGATTCGCA GCGAAGGGCT CAGCCAGATG ACGGGCGATG TGCGCCAGTT CGGCATGGCC AAGGAGGGGC TGATCGCCCG TCAGTTCATG CTCGGCGTGG TGCAGACCGC CGAAGGGCTG CCGATCTACC ATGAGGTGTT CGATGGCAAC GCGGCCGAAA CCAGGACCTT GCTGCCCACG CTCACCAAAG TGCTCGAGCG CTTCCCTGCG GTGCAGCGCC TGGTGCTGGT CGCCGACCGG GGTCTGCTCA GCCTGGATAA CCTCGAGGCC CTGAAGTCCG TGCGTCTGGC CAGCGGCAAG CCGCTCGAAT TCATCGTCGC GGTGCCGGGT CGGCGTTACA ACGAGTTCAT CGACCTGCTC GAACCCTTCC ACGAACAGCA ATGCGTCGGC GCGACCCAGG AAGTCATCTC GGAGCGCGCC TGGAACGCGC TGCGGCTGGT GGTCGCGCAC GATCCGCTCG CCGCCGCCGA CAAGACGCAG CAGCGCAACG CGCGCATCGA TGCGCTGTTG CGTCAGGCCG AGCAATGGAC GGGCAAGCTC ACCGACCAGG ACGAAGGCGT CAAGTATCGC GGTCGCAAGC TCTCGGACAG CGGCGCGAAG GCGCGCTTCT ACCATGTGGT GAGCGAAGCG CACCTGTCGC GCATCATCAA GGTGGATCTG GCCGAGGAGC TCTTCAGCTA CGACATCGAC GACAAGGCCC GGCGCCTGGC CGAGATGATG GACGGCAAGC TGCTGCTGGT CACCAACGCC GAGGGGCTCT CCGCGCAGAA CGTGATTCAG CGCTACAAGT CGCTCGCCGA CATCGAGCGC GGCTTCAAGG TGCTCAAGTC CGAGATCGAG ATCGGCCCCG TGTATCACCG CCTGCCCGAG CGGATCCGCG CGCATGCGTC GATCTGCTTC ATGGCGCTGA TCCTGCATCG GGTCATGCGT CGCCGGCTCA AGGCCGCCGA CGCGGGCTAC ACGCCCGAGC GGGCGCTCGA ACAACTGCAG CGCATCCAGC ATCACCGCGT GCGCCTGAAC GGCGGCGAGC CGGTCGCCGG GGTGTCGACG ATCAGCACGG AGCAGAACGA GGTGCTTCAT GCCTTAGGAA TAGGAAAACC GACGGCGCCG GAGCAGCTGG CGCTGTTGTA G
|
Protein sequence | MHVKLTTSGG RRYVQLVESY RDEAGRVKKR TVATLGRAEQ VDGSLDAVIN GLLKITGREP MGAKPPAPTV SFESARALGD VWALTELWNS LGFSGLRRVF SRTRHTTDVE ALIRLMVLNR LCDPESKLGV LRWVHTVALP DFRPKAVTHQ QLLRSLDALM DHQDEVDAVV AGLLRPLIDQ DLSVVFYDLT TIRSEGLSQM TGDVRQFGMA KEGLIARQFM LGVVQTAEGL PIYHEVFDGN AAETRTLLPT LTKVLERFPA VQRLVLVADR GLLSLDNLEA LKSVRLASGK PLEFIVAVPG RRYNEFIDLL EPFHEQQCVG ATQEVISERA WNALRLVVAH DPLAAADKTQ QRNARIDALL RQAEQWTGKL TDQDEGVKYR GRKLSDSGAK ARFYHVVSEA HLSRIIKVDL AEELFSYDID DKARRLAEMM DGKLLLVTNA EGLSAQNVIQ RYKSLADIER GFKVLKSEIE IGPVYHRLPE RIRAHASICF MALILHRVMR RRLKAADAGY TPERALEQLQ RIQHHRVRLN GGEPVAGVST ISTEQNEVLH ALGIGKPTAP EQLALL
|
| |