Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3166 |
Symbol | |
ID | 7874307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3435769 |
End bp | 3440727 |
Gene Length | 4959 bp |
Protein Length | 1652 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643700095 |
Product | hypothetical protein |
Protein accession | YP_002890139 |
Protein GI | 237653825 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1112] Superfamily I DNA and RNA helicases and helicase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA CGCCTCGGCA GCTTCTGCAC AACCTTTTGG ATTACATACG AGAGCAGGCC AAGAACGTCG ATCCGAAAGG GTTTCGGCTT TCAGCTGCCA AGGGCTTCCT TCGTCGTCGG CCCGAACTCG CAGGCCTCCC GGGCGTCACC TTCGACCTCA AGGTCGAAGG TGACCACACG TGGCTCCAGG TTGAACGATT GGATGCACGC AGACCACCAG CGCCTCCTGA GCATGCAAGT GACCTGCTTG TAGTTGGCGC CGACCCGTTC GGTGCCCCGC CAGTCGTGAG CGAGGCAGGC ATCAAGTCGC GCATCACCAG GCTTCGAGAA CGGAGACCGG ACGATACCTT TGAGGCCGCA GCCGCCGAGA TACGCGCCAC CGCAGACGCA ACCCTGAAAG CATACTCGCC TATCTGGCAG GCCTGGGCTG AGGGCGAACG GCCTCGTAGG AAGACGATTG CGCTGTACGC TGAACTGTTC GCCCTGAAGC ACCAGCTCGA GGCGGAAGAG ACTGCTCGGC CTCAAGAGCT CGTGTGGGGC ATCGGCGTCG CCAGTTGGAA CATCCCGGTC GACTCGGACA GGATCGCGTT CGAATATCCG CTAATCACTC AGGCGCTCGA AATCTCCATC GACGAACGCA CCATGATGCT TGAGGTGCGC CCTCGCTCTA CCGATACTCG CGTCGAGTTG GACGGTTTCA CTGCTTGCGC CGTCATCGGA GCGGCCGAAC TCGAGCTCGC GGCAAAAGAA CACCTGAAGC GGCCTGGCGC CGCGCCAGTG ACGCCTTTTG ATGCATCGAC CTACGGTGAC ATCCTCCGGC TCGTCGCCAG CAACATGGAC AGTGGTGGGC AATATCGGGA AGTGCAGGGA AAGGCTGAGC CCGTCCCACC CCCTGGCTCG AACCTCATCG TCACAGACGA GTGGGTCCTT CTCTCACGCC CCCGGTCCAA CAATTACCTG TTTGACGACC TCCGACGCCT TCAGGAAAAG CTCGAGGCCG GCGTAGAGAT CCCTTCGGGC CCACTGTCCC TCGTTTCTCC CCCTTCAGAT GAGCCGGTCC GCCATTCAGA GATTCGCTTC CGCGGACTAT CGACCCGCGG CTCGGGGAGT GGTGGGGGTG CTGCAAGTAC GCAAGAACTG TACTTCCCGC TGCCCTACAA CGACGAGCAA GTCACCATCA TCCAACAGCT CGAGCAGTCC CCCGGTGTGA CCGTCCAGGG ACCGCCGGGC ACGGGCAAGA CTCACACAAT CGCCAACATC ATCTGCCACT ACCTCGCGAG CGGGAAACGC GTGCTGGTGA CCTCGCGCGG AGAGCAAGCA CTTGAGGTGC TGCAGTCGAA GATTCCCGAG GGTGTGCGCA AGCTCACGGT CTCGCTGCTC GCAAGCGACC GAGAGGGTGT GCGGCAATTC CAGGCATCCA TCGAGGCGAT ACAACACCAG ATTTCACAGC TGAACCCGGC ACTCACCCAA CGCGAGATTT CCGAGCGCCA AAGTGCCATT GACCGCGCAC ACGAGGAACT CGCGCTAATA GACCGGCGCG TAGACGAGAT AGCTGCCACG CAGTTGCAAG AGATCGAGGT GGACGGCGTC CCGCAGCGCG CAGCGAAGAT GGCCGAATTG GTCCTCTCCG GCCATGAACA TTTCGGCTGG TTCGACGATG TCATCGACCT GGACGCCAAA CATGCTCCCC CTCTGACGCA GGAAGAGGCA AGCCAGCTGA GAGAGTCCCG AAGGATCCTG CGCGACGACC TGTGCTATGT GGGCGTTCAG CTACCTGCCA CGGACGCCCT GCCCAGCGTG GCCGACATCG CTCGACTGCA CGAGATTCTT GTCAGAATGC GCCGGCTCGA ATCTCGTATC AGAAGTGGCG GGCTTCTTCC TCTCAAGGCA AGTAGCCCCG AAGTACTGCA GGCCGCTCAA GACTTGCTCG CACGAATCGA CGCTACGCTC GAAGTCGTCA CTGGTCTGGA AGGTCTCGGT GACGCTTGGC CCTTCGCGCT CCGTGAAAAG TGCCGACATG CCAACTTCAA GGCTGAACGC GAGGCCTTGG AGTCGTTGTT CAATGACATC GACACTTTGA TTGAAGCACG CGCAGGTTTC TTGAAACGCC CAGTCGATTT CCCTGAGCGC GGACTCACCT GCCAGAAGAC GAAGGAAGCC GTTAAGCGCG CCGCCGAGAC GGGCAAGCCT TTTGGCTTTA TCGCATTCGG GGCGAGCGAT GCCAAAGAAA ATGTGGGGCA GATTCGGGTT TCGGGCCTTG TCCCACAGTC GACGGCCGAT TGGCAACATG TAGACCGGTA TTTACAGCTG CACCAGGAAA TCCTGTCGTT CCAGGTTCGC TGGAATAACT TTGCGGATGA CCTCTCGCTA CCACGGCTCG AGGGCGGCGT TGCCAGCCTC CGAAGGATCG AGTTGATCGC GAGCGCCGCG AAGAACGCGC ACAAACTGGC GACGGAGCAT GACGCCCAAC TGGTGGCGAA GGCGCAGGAG GTCTTCCAGC GCGCCCCGCT CAAGGCGCTC ACCAGCGATT CAACCGAGCT GCGCTCGGTC CGTGAGCAAC TTGACGACCA CCTGACAAAT GCGACGCTGG CTCAGGCGAC TGTCCAACTC GCAACCCTGC AAGAAAAGCT CGTGGGCACG TCAGGGCCGG TTATCGACAC GCTCAGATCG TTTGTCCTTC GCGAGTTGGG CAAGGTCGAC CTCGAACCAG AGCGCGTTGC GGCAAGGTAT GCGGAGTTGC TGGCAGAGCT TCGTCGCCTC GCGGAGCTCG GCACCCATTT GTGCCGAGTG AACGACTACG CTAGCCGCAT TGAACGAGCG GGTGGCGTGA AGCTCGCGCA GCGAATCCGG ACCGAGGCTG TCGGCATGGC TGGGGAAGAC CCAGTCTTTC CTCCGACATG GCGAGAAGCC TGGAACTGGG CACGGGTACG GGCCCACCTG CAGTCGATTG AAGCACGCGA GGAACTCGTC AGACTGTTCG CACGACGCAA AGACCTGGAA AACGGGCTGG CGCGCCTGTA TCAGGAGCTC GTTGCAAAAT CGGCGTGGCT CTCAACCAAG CGTAACGCAA CGCAACGAGT CCTCCAGGCG CTGGCGGGCT ACGCTTCCGC CATTCGCCGC ATCGGCCAGG GGACCGGCCC GAATGCCACC CGTTATCGAC GGGACGCGCG CGAGGCGATG TTTGAAGCCG CTGGCGCAGT TCCGTGCTGG ATCATGAGTC ACGCCCGAAT CTCCGAGGCG ATGCCGGCCG ATATTGGCGC TTTCGACCTC GTCATTGTCG ACGAGGCAAG TCAGTCCGAC CTGTGGGCGC TGCCGGCGAT TCTGCGCGGA AGGAAGATTC TCGTGGTCGG CGACGACAAG CAGGTCTCGC CGGATGGCGG GTTCATTGCG AGTGCCACAA TTAACAATCT GAAGGCGCGC TTCCTGTCGG ACCAGCCGTA CGGCAACGAC ATGACGCCGG AGAAGTCCCT GTACGACCTG GCGGCACGAG TGTTCGCGGC GAACCAGGTA ATGCTCAGAG AGCATTTCCG TTGTGTCCCG GCCATCATCG CCTACTCGAA CCGTACCTTC TACAAGGACA ACATCCAGCC ACTACGCATT CCGAGCGCTC ATGAGCGCAT CGACCCTCCC TTGGTGGACC TTTACGTCCC ACATGGCATG CGAGATAAGC ACAGCTGCAA CAAGGCGGAG GCTGAAGCAA TTGCCGAGGA GATTGCTGCC ATTCTCGACA ACCCAAGCCT CGTAGGTCGC ACGATTGGCG TTGTCTCTCT GCTCGGCATG GAGCAGGCAA AGCTCATTGA TAGCGTCGTA CGTCAGAGGT GCGACGCGGC CGAACTCCTG CGCCGCCGGT TTGACTGCGG CGACGCACGG ACCTTCCAAG GGAGCGAGCG CGACATCATG TTCCTTTCCA TGGTGGTCGA CAGCAAGAGC TGCAAGGCAC TTTCCGGCAC CATGTACGAC CAGCGCTTTA ACGTTGCCGC GAGCCGCGCC CGCGACCGCA TGTACCTTGT CCGTTCGCTC AAAGTCTCGG ACCTGTCAGA CAAGGACCTG CGTCTCACCC TCGTAAGCCA TTTTGACAAA CCGCTCATTG CCGATGAGCA ACCGGCAGAA CTCCTCATTG ACCTGTGCGA GTCGGGATTC GAGCGCGAGG TCTTTACGAG GCTCTCGGAG CACGGCTATC GGGTCATTCC CCAGGTCAAG GCGGGCGCAT ACCGTATTGA TATGGTCGTC GAAGGTGCGG GAGACGCCCG CCTCGCCATT GAGCTCGATG GCGACGACTA CCACGGACCG GACCGGTGGG CCAACGACAT GGCCCGCCAA CGCGTTCTTG AACGCGCAGG CTGGGTCTTC TGGCGGTGTT TTGCGTCGAC CTGGTCACTG CAAAAGGACG AAGTCTTCGC AGAGCTCCTC TCAAGGCTCC AGGCGATGGG TATCGAGCCG ATTGGGGCGA TGGAAAGAGT ACCCTCGCTG GTCGAAAAGC GAACCTGGAC ACGTTCAGAC GGGGAGCGCT ACGGGGCCGA CGATCCCGCC GCTTACGTCC TGGAGGACGC AATCGCTCAA GCGAACCGCG AGAACGCCGC GGCGGCATCT CCGACGGAAA CCGATGCGCC TCGTTCACCC GAGCCGGCGC ACAGCGATGC CCTACCGACA ACTACCGAGA GCGACATCGC CTACCTGATC ACCGTACAGC AGTACGAGGG CGGACCGTGG TGGACGCGAG CATTCCCTGC GCACCACACC CTTGTTGCTC TTCGTCAGGC AAATGGAGGT CGGCGCGCCT CATCAACCCA GGACGCCGCC GCCGCTGTCA GTGCTCACGT AGATGCAGTC CTCGACGAGT TGCTGGAGAA GAAAATCGCT CCGGCCGCCG AAGCCGCTCA CCCTAAACGT GAGGAGGCGA TTCGCATTGC GACAGAAACA GTGAAGGCGT CAGGGTGGAA GGCCGTCCTG GTCGACTGA
|
Protein sequence | MKNTPRQLLH NLLDYIREQA KNVDPKGFRL SAAKGFLRRR PELAGLPGVT FDLKVEGDHT WLQVERLDAR RPPAPPEHAS DLLVVGADPF GAPPVVSEAG IKSRITRLRE RRPDDTFEAA AAEIRATADA TLKAYSPIWQ AWAEGERPRR KTIALYAELF ALKHQLEAEE TARPQELVWG IGVASWNIPV DSDRIAFEYP LITQALEISI DERTMMLEVR PRSTDTRVEL DGFTACAVIG AAELELAAKE HLKRPGAAPV TPFDASTYGD ILRLVASNMD SGGQYREVQG KAEPVPPPGS NLIVTDEWVL LSRPRSNNYL FDDLRRLQEK LEAGVEIPSG PLSLVSPPSD EPVRHSEIRF RGLSTRGSGS GGGAASTQEL YFPLPYNDEQ VTIIQQLEQS PGVTVQGPPG TGKTHTIANI ICHYLASGKR VLVTSRGEQA LEVLQSKIPE GVRKLTVSLL ASDREGVRQF QASIEAIQHQ ISQLNPALTQ REISERQSAI DRAHEELALI DRRVDEIAAT QLQEIEVDGV PQRAAKMAEL VLSGHEHFGW FDDVIDLDAK HAPPLTQEEA SQLRESRRIL RDDLCYVGVQ LPATDALPSV ADIARLHEIL VRMRRLESRI RSGGLLPLKA SSPEVLQAAQ DLLARIDATL EVVTGLEGLG DAWPFALREK CRHANFKAER EALESLFNDI DTLIEARAGF LKRPVDFPER GLTCQKTKEA VKRAAETGKP FGFIAFGASD AKENVGQIRV SGLVPQSTAD WQHVDRYLQL HQEILSFQVR WNNFADDLSL PRLEGGVASL RRIELIASAA KNAHKLATEH DAQLVAKAQE VFQRAPLKAL TSDSTELRSV REQLDDHLTN ATLAQATVQL ATLQEKLVGT SGPVIDTLRS FVLRELGKVD LEPERVAARY AELLAELRRL AELGTHLCRV NDYASRIERA GGVKLAQRIR TEAVGMAGED PVFPPTWREA WNWARVRAHL QSIEAREELV RLFARRKDLE NGLARLYQEL VAKSAWLSTK RNATQRVLQA LAGYASAIRR IGQGTGPNAT RYRRDAREAM FEAAGAVPCW IMSHARISEA MPADIGAFDL VIVDEASQSD LWALPAILRG RKILVVGDDK QVSPDGGFIA SATINNLKAR FLSDQPYGND MTPEKSLYDL AARVFAANQV MLREHFRCVP AIIAYSNRTF YKDNIQPLRI PSAHERIDPP LVDLYVPHGM RDKHSCNKAE AEAIAEEIAA ILDNPSLVGR TIGVVSLLGM EQAKLIDSVV RQRCDAAELL RRRFDCGDAR TFQGSERDIM FLSMVVDSKS CKALSGTMYD QRFNVAASRA RDRMYLVRSL KVSDLSDKDL RLTLVSHFDK PLIADEQPAE LLIDLCESGF EREVFTRLSE HGYRVIPQVK AGAYRIDMVV EGAGDARLAI ELDGDDYHGP DRWANDMARQ RVLERAGWVF WRCFASTWSL QKDEVFAELL SRLQAMGIEP IGAMERVPSL VEKRTWTRSD GERYGADDPA AYVLEDAIAQ ANRENAAAAS PTETDAPRSP EPAHSDALPT TTESDIAYLI TVQQYEGGPW WTRAFPAHHT LVALRQANGG RRASSTQDAA AAVSAHVDAV LDELLEKKIA PAAEAAHPKR EEAIRIATET VKASGWKAVL VD
|
| |