Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3179 |
Symbol | |
ID | 7874319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3455330 |
End bp | 3457057 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643700107 |
Product | protein of unknown function DUF262 |
Protein accession | YP_002890151 |
Protein GI | 237653837 |
COG category | [S] Function unknown |
COG ID | [COG3472] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAGC AAAACATCCC CATCGGCACG CTGGTAGATA TGTATAAGCG CGGCGAGCTG CGCCTGCCTG AAATCCAGCG TCACTACGTC TGGCAAGCCA CCCGCGTCCG CGACCTGCTG GACTCGCTCT ACCGTGGCTA CCCCAGCGGC TCCATCCTGA TGTGGGAGAC CGAAGAGCCG GTGCCGACGC GCGATTTCGC CATCGCGCAG GAAACCAATG CCTTCGCCGG CCGCAAGCTA CTGCTCGACG GCCAGCAGCG CCTGACCTCG CTGACTGCCG TGTTGGGCGG CGAGATGGTG TCAGTCCGTG GCCGTAAGCG CCCCATCGAC ATCCTGTTCA ACCTCGAACA CCCCGAGGGG CCGCCCACCG ACGACACCGA GGTCGAAACC GATGAGCCTT CACCGCTCAC CCCGGACGAC GAAGTGAGTG ACGAGGCGGA GGAGTCCGAC GAGGCTGAAC AGGGCCTGCA GGAAAAGCTC AACCGCCGCA CCTTCGTGGT GGCGAGCAAG AACCTGCTGT CGCAACCCAA TTGGGTGTCG GTGAGCCATG TTTTCGGCAC GGCGAACGAC ACCGATATAT TGAAGAAAGC GGGTATCAAG GACCTCGACG ACCCGCGTTA CCAGAAGTAT TCCGACCGCC TGAAGAAGCT GCGTGCCATC AAGGACTACC AATACGTGGT CCACGTGCTC GAACGTGCCA TGAGCTACGA GGAGGTCACG GAAATCTTCG TGCGGGTGAA TTCGCTGGGT GCGAAGCTGC GTTCGTCGGA CCTGGCGCTG GCACAGATGA CTTCGCGCTG GCCGAATCTG CTGAAGGAGC TCGAGGCTTT CCAGGAGGAG TGCGAGCAGA GCTGGTTCAC GATTGAGCTT GGCCATCTTG TGCGGGCCAT CGTCGTCTTC GCCACTCAGC AGTGTCTGTT CCGCTCGGTG GCGAGCACGC CGGTGGACAA GCTCAAGGCG GGCTGGGAGC AGGCCAAGGA AGGACTGCGC TACGCCATCA ACTTCCTGCG CACCAATGTT GGCATCGAGG ATGAATCCTT GCTGTCCTCG CCGATGTTCA TCCATACGCT CGCGGCTGTG AGCCGGGTGA AGGACAACAA GCTCACCGCC GACGAGCAGA ACAAGCTGCT GCACTGGCTG CTGGTGGCCA ACGCGCGTGG TCGGTATTCG CGGGGCTCGA CCGAAACCCT GCTGAACGAG GATTTGGCCA TTGTTTTCCG CGACCAGGAC ATTGGGAAGC TGATGGAGCC GGTCAAGCGC CAGTTCGGCC GCTTGCATGT GGAGCCGGGC GACTTGGCGG GCCGCGGGGT GAACAGCCCT CTGTTCTCGC TGGCCTACCT GGCGTTGAAG GCCTCGGGCG CCAAGGACTG GTACAGCGGC CTGGGGTTGT CGCTGACCCA TCAGGGCAAG CTCCATTTCA TCCAGTGGCA CCACATCATC CCGAAGTCGC TGCTCAAGGC GCAGGGCTAT GAGACCGGTG AAATCAACGA AATCTCCAAC ATGGCCTTCA TCACCGGCCA GACCAACCGG CGCATCAGCA ACAAGGAGGC GACCGATTAC CTGGCCAACA TCGTAGACAA GCAGGGCGCG CAGGCGCTGA CCAGTCAGTG TGTGCCGACC GACCCTGAGC TCTGGGCGAC GGCGCGTTAT CGAGACTTCC TTCAGCAGCG CCGGGTCGCG CTGGCGGAGC GGATGAACAG CTTTATCCGG GAGAAAGCCA AGCTATGA
|
Protein sequence | MQQQNIPIGT LVDMYKRGEL RLPEIQRHYV WQATRVRDLL DSLYRGYPSG SILMWETEEP VPTRDFAIAQ ETNAFAGRKL LLDGQQRLTS LTAVLGGEMV SVRGRKRPID ILFNLEHPEG PPTDDTEVET DEPSPLTPDD EVSDEAEESD EAEQGLQEKL NRRTFVVASK NLLSQPNWVS VSHVFGTAND TDILKKAGIK DLDDPRYQKY SDRLKKLRAI KDYQYVVHVL ERAMSYEEVT EIFVRVNSLG AKLRSSDLAL AQMTSRWPNL LKELEAFQEE CEQSWFTIEL GHLVRAIVVF ATQQCLFRSV ASTPVDKLKA GWEQAKEGLR YAINFLRTNV GIEDESLLSS PMFIHTLAAV SRVKDNKLTA DEQNKLLHWL LVANARGRYS RGSTETLLNE DLAIVFRDQD IGKLMEPVKR QFGRLHVEPG DLAGRGVNSP LFSLAYLALK ASGAKDWYSG LGLSLTHQGK LHFIQWHHII PKSLLKAQGY ETGEINEISN MAFITGQTNR RISNKEATDY LANIVDKQGA QALTSQCVPT DPELWATARY RDFLQQRRVA LAERMNSFIR EKAKL
|
| |