Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0031 |
Symbol | |
ID | 7083414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 37900 |
End bp | 39534 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643697081 |
Product | TIR protein |
Protein accession | YP_002353730 |
Protein GI | 217968496 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATCG CAGCGAAGAT CGCCCCGAAG GTGTTCGTGT CCTATGCCCG GCAGGACTGC AGCACGCTGG CCGAGGAACT GGTCACGGCG CTTGAGTTGC TTCAGTTCGA GGGCTACCTG GACCGCAGCG ACATCGCGGC GGGGGAGGAC TGGGAGCATC GGCTGGATGC CCTGATCCGC CAGGCGGATA CGGTGGTCTT TGTCCTCTCG CCACGGTCGG TGCAGTCCGA GCGTTGTGCA TGGGAGGTGC AGCGCGCGTT GGCACTGTCC AAGCGCATCA TCCCGGTGGT GGGCATGGCG GTCGATGACG CCTCGGTCCC GGCGCCGCTG CAACGGCTCA ACTACATCCA TTTCACTGCT GGCCACTCAT TCGCGCGCTC CCTCGGTCAA CTGGCCGACG CGCTGCGGCT GGACATCGGC TGGATACGTG AACACACGCG ATTGGGTGAG CTCGCTCTTC GATGGAATGA ACGGCAGCAG CCTGATGCCT TGCTGCTGCG CGGCGACGAG TTGTCTGCCG GGCAGGCGTG GATGGCGAAC TGGGCGCCGG AGTTTCCGCC CGTCACCGAG TTGCAGCGCA GCTTCATCGC CGCGAGCGCG GACGCGCAGA CCCGGCGTGA GAGTCTGGAA CGGCAGCAGA ACGAGGCGAT CGCCAAAGCC AACGCCGAAC GCGCCGAGGC CCTGACCCGC CGCGAGGAGG CGTTGTTCTC GTTGAAGCGC CGCACCTTGC TGGGCGGCGT GCTCGCGGTG GCGCTCTCTC TCGGGCTCGG CGGCATGGCC TGGTGGTCGC TGCAGCTTCG CCGCCGCGCC GAGGAGGCCG AACGGACAGC CATCGACGAG CTTGTTCGCC GCGAGGCCAT GCGCACGGAC ATCAGCGGAC AGATCGTGGC CTACGCCACA TCGCCCGGCC AGTGGGCGAT GGACAGTGGC GTGGATGGCC ACTCGCCCTA CACCGGCACC TTGCTGCGGG AACTGCAGTC GCCCGACATC TCGTTGTGGG TGGCGCTATC CAGGACGACG ACCCAGGTCG CGAAAGCCAC CAACGGCAGT CAGCGGCCGT TCATTTCGTC CGACATGAAC GGTGACGTGT TCCTCGGCCA CCCCTCACCC ACGCGTCGTC TGCGCGCGCT CGTCATTGGC GCAGGACGGT TCCAAATGGC GACCGACCTA TCCTTTGAGG GGGCGTACAA GGATGCCGAT GCCTGGGGCG CGTTTCTGGC CGGGCGCGGA TTCAGCGTGC AAACGCTGCG CGATCCGACG CGGGCGTCCG TGCTGGCGTC GATCGAGGCA CTTCGCGTTT CCGCTCTAGA CGAGGCGGAC GCGTCGATTC GACGCGTGGG CATTGCTCTG CAGCCGGATG GCACGCAGGC CCAGCCGTCA CTCCCCGTGC CTCGTCGAAC ACGGCCCGAC GCCGAGCCGG CGCACGATGC TCTGATCGTC TTCTTTTACG CCGGCTACGG CTTCCGCGCG GGCGCCGAGC GTTTTCTCGC CGTGTCGGAC ACGGCATTCG ACACCGCGAA AACCGGGCTG GTGACGGAGC CCGGTGCGAC GGCGGTGTCG GTGGACGATC TCGAAAAAGT GCTGCGCGAA GCGGCCGCCG CTTCGGTGGT GATCCTGGAC ACGAACTTTA TCTAG
|
Protein sequence | MDIAAKIAPK VFVSYARQDC STLAEELVTA LELLQFEGYL DRSDIAAGED WEHRLDALIR QADTVVFVLS PRSVQSERCA WEVQRALALS KRIIPVVGMA VDDASVPAPL QRLNYIHFTA GHSFARSLGQ LADALRLDIG WIREHTRLGE LALRWNERQQ PDALLLRGDE LSAGQAWMAN WAPEFPPVTE LQRSFIAASA DAQTRRESLE RQQNEAIAKA NAERAEALTR REEALFSLKR RTLLGGVLAV ALSLGLGGMA WWSLQLRRRA EEAERTAIDE LVRREAMRTD ISGQIVAYAT SPGQWAMDSG VDGHSPYTGT LLRELQSPDI SLWVALSRTT TQVAKATNGS QRPFISSDMN GDVFLGHPSP TRRLRALVIG AGRFQMATDL SFEGAYKDAD AWGAFLAGRG FSVQTLRDPT RASVLASIEA LRVSALDEAD ASIRRVGIAL QPDGTQAQPS LPVPRRTRPD AEPAHDALIV FFYAGYGFRA GAERFLAVSD TAFDTAKTGL VTEPGATAVS VDDLEKVLRE AAAASVVILD TNFI
|
| |