Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3805 |
Symbol | |
ID | 7874047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4198696 |
End bp | 4199814 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643700747 |
Product | transposase IS4 family protein |
Protein accession | YP_002890771 |
Protein GI | 237654457 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5433] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATC TGGACGAAAT CGATGATCCG CGCAAGCCGA GCAACGGCAC CTTGCACGAC TTCCGGGAGA TCCTGGTGAT CCTGATCGCC GCCGTGCTCT CGGACTGCGA CACGGTCGAG GACATCACTT TCTGGGCACG CACCAAGCAG GCGTGGCTGC GTCGCTTCCT CGTGCTCAAG AACGGCATCC CGTCCGAGGA GACCTTCCTT CGGATCCTGC GTGCGCTCGA TCCGAAGCAG TTCGAGAACA TGTTCCGGCG CTGGGTGGGC GGCGTGGTCG GTGCGCTCAG CGACGATGCG GGCCTCGCCG GCACGATCGC AATTGACGGC AAGACCGTGC GCGGCTCGGG CAGCGGCGGC GAGAGCGCGA TCCACATGGT CAGCGCCTTC GCCACCGAGT TGGGACTGGT GCTCGGCCAG GAGAAGGTCG CCGCCAAGAG CAACGAGATC ACCGCGATTC CGGAGTTGCT GGAGGCGCTC TCGATCAAGG GGCTGCTGGT CACGATCGAC GCCATGGGCT GCCAGAAAAG CATTGCCAAA CAGATCGTTG CGAAGAAAGG CGACTACCTG CTGATGGTCA AGGGCAACCA GCCCAAGCTG CTCGAAGCGA TCGAGACCGC CTTCATCGAT CAGCACGGCG TCGCCTCGGT CGACCGCAGT TCGCTGGTCG AGCGCGGCCA CGGCCGCACC GTCGGGCAGA TCGCCTCGGT GCTCTCGGCC AAGGGCATCG TCGATCTGGC CGACTGGCCC AAGTGCGTGA CGATCGGGCG CATCGACTCG ATGCGGGTGG TCGGCGACAA GCAATCCGAT CTCGAGCGGC GTTACTACAT CAGTTCGCGC GCACTGAGCG CCGAGCAACT GGCGGCAGCG GTACGTGCGC ATTGGGGTGT GGAGAACCGG CTTCATTGGA TCCTCGATGT CAGCTTCAGC GAGGACGCCA GCACGGTGGC CAAGGACAAC GCGCCGCAGA ACCTTTCGCT GCTGCGCAAG ATCGCGCTCA ACATCATCCG TGCCGACAAG ACCGACACGC GCAAGAGCAG CCTTCGGCTC AAGCGCAAGG GGGCGGCGTG GGATGACGGG GTGCGGGAGC GCATGCTGGG GATCCGGGCG ATATGCTAG
|
Protein sequence | MSYLDEIDDP RKPSNGTLHD FREILVILIA AVLSDCDTVE DITFWARTKQ AWLRRFLVLK NGIPSEETFL RILRALDPKQ FENMFRRWVG GVVGALSDDA GLAGTIAIDG KTVRGSGSGG ESAIHMVSAF ATELGLVLGQ EKVAAKSNEI TAIPELLEAL SIKGLLVTID AMGCQKSIAK QIVAKKGDYL LMVKGNQPKL LEAIETAFID QHGVASVDRS SLVERGHGRT VGQIASVLSA KGIVDLADWP KCVTIGRIDS MRVVGDKQSD LERRYYISSR ALSAEQLAAA VRAHWGVENR LHWILDVSFS EDASTVAKDN APQNLSLLRK IALNIIRADK TDTRKSSLRL KRKGAAWDDG VRERMLGIRA IC
|
| |