Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3781 |
Symbol | |
ID | 7874024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4166572 |
End bp | 4167690 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700724 |
Product | transposase IS4 family protein |
Protein accession | YP_002890748 |
Protein GI | 237654434 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5433] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTGTC TGGACGAAAT CGATGATCCG CGCAAGCCGA GCAACGGCAC CTTGCACGAC TTCCGGGAGA TCCTGGTGAT CCTGATCGCC GCCGTGCTCT CGGACTGCGA CACGGTCGAG GACATCACTT TCTGGGCACG CACCAAGCAG GCGTGGCTGC GTCGCTTCCT CGTGCTCAAG AACGGCATCC CGTCCGAGGA GACCTTCCTT CGGATCCTGC GTGCGCTCGA TCCGAAGCAG TTCGAGAACA TGTTCCGGCG CTGGGTGGGC GGCGTGGTCG GTGCGCTCAG CGACGATGCG GGCCTCGCCG GCACGATCGC AATTGACGGC AAGACCGTGC GCGGCTCGGG CAGCGGCGGC GAGAGCGCGA TCCACATGGT CAGCGCCTTC GCCACCGAGT TGGGACTGGT GCTCGGCCAG GAGAAGGTCG CCGCCAAGAG CAACGAGATC ACCGCGATTC CGGAGTTGCT GGAGGCGCTC TCGATCAAGG GGCTGCTGGT CACGATCGAC GCCATGGGCT GCCAGAAAAG CATTGCCAAA CAGATCGTTG CGAAGAAGGG CGACTACCTG CTGATGGTCA AGGGCAACCA GCCCAAGCTG CTCGAAGCGA TCGAGACCGC CTTCATCGAT CAGCACGGCG TCGAGTCGGT CGACCGCAGT TCGCGGGTCG AGCGCGGCCA CGGCCGCACC GTCGGGCAGA TCGCCTCGGT GCTCTCGGCC AAGGGCATCG TCGATCCGGC CGACTGGCCC AAGTGCGTGA CGATCGGGCG CATCGACTCG ATGCGGGTGG TCGGCGACAA GCAATCCGAT CTCGAGCGGC GTTACTACAT CAGTTCGCGC GCACTGAGCG CCGAGCAACT GGCCGCAGCG GTACGTGCGC ATTGGGGTGT GGAGAACCGG CTTCATTGGA TCCTCGATGT CAGCTTCAGC GAGGACGCCA GCACGGTGGC CAAGGACAAC GCGCCGCAGA ACCTGTCGCT GCTGCGCAAG ATCGCGCTCA CGATCATCCG CGCCGACAAG ACCGACACGC GCAAGAGCAG CCTTCGGCTC AAGCGCAAGG GGGCGGCGTG GGATGACGGG GTGCGGGAGC GCATGCTGGG GATCCGGGCG ATATGCTAG
|
Protein sequence | MSCLDEIDDP RKPSNGTLHD FREILVILIA AVLSDCDTVE DITFWARTKQ AWLRRFLVLK NGIPSEETFL RILRALDPKQ FENMFRRWVG GVVGALSDDA GLAGTIAIDG KTVRGSGSGG ESAIHMVSAF ATELGLVLGQ EKVAAKSNEI TAIPELLEAL SIKGLLVTID AMGCQKSIAK QIVAKKGDYL LMVKGNQPKL LEAIETAFID QHGVESVDRS SRVERGHGRT VGQIASVLSA KGIVDPADWP KCVTIGRIDS MRVVGDKQSD LERRYYISSR ALSAEQLAAA VRAHWGVENR LHWILDVSFS EDASTVAKDN APQNLSLLRK IALTIIRADK TDTRKSSLRL KRKGAAWDDG VRERMLGIRA IC
|
| |