Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0960 |
Symbol | |
ID | 7085063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1052784 |
End bp | 1056191 |
Gene Length | 3408 bp |
Protein Length | 1135 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643697982 |
Product | hypothetical protein |
Protein accession | YP_002354622 |
Protein GI | 217969388 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGAC CACGCAAGAC CCCTGCCGCC GGTTCGGGCG CAAGCGGCGC AGCCGGTTCC AGTCGCGGCG GCAAGAAGCG CAGCTTCCAT CAGGAATTGG TGCTCAACCG CTGGGTGCTG GGTTTCTTTC AGGGCGGCAC GCTCGCGGCA CTCAAGATGC GCTTGGGCGA TGACCGCTTT GAGGGCATCG ACGAGGACGG CCAGACCAAG TTCTTCCACG AACTGATCCG AGGCTTGTTC GATCCCAACA AGGTGCCCGA GGCCGACTTG CGACGCTACG ACTTGAACGT CGTCGCCCAC TGGCAGGCCA TTACCGCCCA GCGCAACAAG CTGGAAGGCC ACGAACTGCA GATGAAGTAC TTCCAGTACC TCTCGCTGCT GTTCACCGAG CTGTATCTGG ACTGGTACTT CAACCACCGT CAGGACTTGC TCGATGGCCT GAACGAGGAA ATGACGCGCT ACCGCGCCGA GACGGGCGCG GAACCGTTCC GCGATTTCGA GGCGGACGAC CTGAACAAGA TCGCCTTCTG GAATGCCACC GGCAGCGGCA AGACCCTGCT GCTGCACGTC AACATCCGGC AGTACCTGCA CTACTTCCAG GCGGGCAGCA GTGGGCAAAT GCGCGATCAC TTTCCCGACA AGATCATCCT GCTCACGCCC AACGAAGGGT TGAGCCGTCA GCATCTGGAG GAGCTGCACC TGTCTGGCTT CGGGTTCTCG CAGTTCTTCA ACAAGGCCCA GTCACCGGCG CGCGGCACCA TCGAAATCAT CGATATAAAC AAGCTGGGCG ATGAGATGGG AGACAAGACC GTCGCCGTGG ATGCCTTCGA GGGCAACAAC CTGGTGCTGG TGGACGAAGG CCACCGTGGT ACGGGTACGG CGGCGGGCGC GTGGATGGCG CGGCGCGATG CGCTGGTGCG CGGTGGTTTT GCCTTCGAGT ATTCGGCCAC ATTCGGGCAG GCGGTGGCTA AGGGGCTGAT CGTTGAAAAG GCGGAGGAGG AAAGCCTCAA GTCCAAATGG AAGCTGGCTT ACCCTGGGCA ACGCTTCAAC CTTGCCTCCG CCAAGGCGGA TACGGGTTTG GCATTGACCG CCGAGGATAA GCGCCGCGCG CGTATCACTG CTACGCGCGA AATCTACGCC AAGTGCATCC TGTTCGATTA CTCGTACAAA TTTTTCTATG AAGACGGCTA CGGCAAGGAG TCGCTGATCC TCAACATGAA CGGCGAGGCC TACGAGCAGG CGGACAACGC GCGCAAGTAT TTCACCGCCT GTCTGCTGGC CTACTACCAG CAGCTCTGGC TGTGGAGCAC GCACCGTTTG GCGCTTGCCG ACTTCAACAT CGAAAAGCCG CTGTGGGTGT TCGTGGGCAA CACGGTGTCA GGCGAGGAGT CGGACATCCT GGAAGTGGTG AACTTCCTCG CCGATTTCCT GAACAGCGAC ACGCAGATCA AGAGCTGGCT GACCGACCTG ATTGCGGACA AGGCGCAGAT TCTCGATGCC AAGGGCAAAA ACATCTTCAG CGGGCGCTTC ACACCCTTGA TGGGCTTCAG CGGTCGTGTG GACGAGCTGT ACGCCGACAT CCTGTTGCGC GTGTTCAATG CCTCGGCCCG CCAGCGCTTG AAACTGGTCA ACATCAAGAG CAGCAAGGGT GAGCTGGCGT TGCGCGTGGG CGATGCAGAG CCCTTCGGCC TGATCAACAT CGGCGACGAT GCAGGCTTCT TCGGGATGGC GGAGGACGTC GAGACCTTCG ACAGCGAACG CGACGACTTC GGCGGCGCGC TGTTCGGCAC ACTGAACGAC AAAGACAGCC GCCTCAACGT GTTGATCGGA TCACGCAAGT TCACCGAAGG CTGGAGCAGT TGGCGCGTGT CCACGATGGG TCTGCTGAAC ATGGGCCAGG GCGAGGGCTC GCAGATCATC CAGTTGTTCG GACGCGGGGT GCGCCTCAAG GGCAAGGGCT TCTCGCTTAA GCGCAGCTTG CCCCAGGACA GGCCCAAGGG CGTGCATCTG GACAAGCTGG AGGCGCTGAA CATCTTCGGC GTGCGCGCCA GCTACATGGC CGCGTTCAAG GACTACCTGC GCGAGGAAGG CATCACGCCC AGCGATGAAG TGCTGACACT GGACTTCGAG ACCAAAGCCA ACCTGCCCGC CAGTAGATTG AAGACCTTGG CGCTTAAGGA TGGCTACAAG GACAACCAGA AACTCGGGTT CAAGCGCACG CACTTCCCGT GGCTGTACGA AATTCCCGCG CAGTTCCAGG GCAAGATCAA GCCCCCCCAT GTGGTACTCG ATCTTTACCC GCGCGTCGAG GCGCTGTCCA GCAAGGACAA GGGCACAGCC CCCACGACGG AAGCCCGCAA CAAGGGCAAG CTGAACCAGA CACTGTTCCC GGCTTTCAAC TGGGATCGGG TCTATCTGGC CTTGCAGGAT TACAAGCTGC AGCGCAGCTG GAGCAACTTG CGGCTGGAAC GTCAGAAGCT GATCGACTTC TGCGCGGGCG CGCAGGACTG GTACACGCTG TTCATTCCGG CGGCGGAGCT GAACGTGAAG ACCTTCGCCG ACATCCGCAA GCAGGAAGAC ATCCTGCTGC GACTGCTGAC GGACTACACC GACCGTTTCT ACAAGGTCCT GAAGACGGGC TACGAAGGCC AGTTCTACGA CATCGCCCAC ATCGATGAAG ATCACGGGTC GATGCTCAAG CTGTACCAGT TCGAGATCGA GAACAGCGAC GATGGTCTGG AGTATCAGGC GAAGCTGGAA GTGCTGAAGA AGCTGGTGAC GGACGGCAAG ATCGGCGAGG CCAGCAAATG GAACGCGCCG CACATGGTGG CCATCAGCTT CGACCGGCAT TTGTACTACC CGCTGCTGGC ATTGGAGGAC AAGGATGCGG TGCCGCTGAA GCTGCGTCCG CTGGCGTTTG ACGCGCCGAG CGAATGGGAG TTTGTCCGCG ATCTGGAGGC GTTCTACAAC TCCGGCGAAG GCAAGGAAGC GATTGGCCCA CGCAGTCTGT ACCTATTGCG TAATGCGGAT CGGGAAGAGA AAGGGCTGGG CTTCGCGTTG GCGGGTAACT TCTATCCGGA CTTCCTGCTG TGGCTGGTGG ACGATGCCAG CGGCAAGCAG TGGCTGACCT TTGTCGATCC GAAGGGACTG CGCAATCTCG ACCTGTCGCA TCCCAAGCTG GGTCTGTACA AGGAAGTGAA GACCCTGGAA ACAACGCTGG CGGCGCAGGC CACGGCGGGC GAAGCACCGC TGGTCTTGAA TGCTTTCGTG CTGTCACCGA CGAAGTTCGC TGATCTGCTC AACGTTGGAG ATCCGACGAA GAAGGCCGAT CTGGAAAACC GCCACGTACT GTTCATGGAG GATGGCAGGG ACATGTACTT AAAGAAGCTG TTTACGGTAC TGACATAG
|
Protein sequence | MARPRKTPAA GSGASGAAGS SRGGKKRSFH QELVLNRWVL GFFQGGTLAA LKMRLGDDRF EGIDEDGQTK FFHELIRGLF DPNKVPEADL RRYDLNVVAH WQAITAQRNK LEGHELQMKY FQYLSLLFTE LYLDWYFNHR QDLLDGLNEE MTRYRAETGA EPFRDFEADD LNKIAFWNAT GSGKTLLLHV NIRQYLHYFQ AGSSGQMRDH FPDKIILLTP NEGLSRQHLE ELHLSGFGFS QFFNKAQSPA RGTIEIIDIN KLGDEMGDKT VAVDAFEGNN LVLVDEGHRG TGTAAGAWMA RRDALVRGGF AFEYSATFGQ AVAKGLIVEK AEEESLKSKW KLAYPGQRFN LASAKADTGL ALTAEDKRRA RITATREIYA KCILFDYSYK FFYEDGYGKE SLILNMNGEA YEQADNARKY FTACLLAYYQ QLWLWSTHRL ALADFNIEKP LWVFVGNTVS GEESDILEVV NFLADFLNSD TQIKSWLTDL IADKAQILDA KGKNIFSGRF TPLMGFSGRV DELYADILLR VFNASARQRL KLVNIKSSKG ELALRVGDAE PFGLINIGDD AGFFGMAEDV ETFDSERDDF GGALFGTLND KDSRLNVLIG SRKFTEGWSS WRVSTMGLLN MGQGEGSQII QLFGRGVRLK GKGFSLKRSL PQDRPKGVHL DKLEALNIFG VRASYMAAFK DYLREEGITP SDEVLTLDFE TKANLPASRL KTLALKDGYK DNQKLGFKRT HFPWLYEIPA QFQGKIKPPH VVLDLYPRVE ALSSKDKGTA PTTEARNKGK LNQTLFPAFN WDRVYLALQD YKLQRSWSNL RLERQKLIDF CAGAQDWYTL FIPAAELNVK TFADIRKQED ILLRLLTDYT DRFYKVLKTG YEGQFYDIAH IDEDHGSMLK LYQFEIENSD DGLEYQAKLE VLKKLVTDGK IGEASKWNAP HMVAISFDRH LYYPLLALED KDAVPLKLRP LAFDAPSEWE FVRDLEAFYN SGEGKEAIGP RSLYLLRNAD REEKGLGFAL AGNFYPDFLL WLVDDASGKQ WLTFVDPKGL RNLDLSHPKL GLYKEVKTLE TTLAAQATAG EAPLVLNAFV LSPTKFADLL NVGDPTKKAD LENRHVLFME DGRDMYLKKL FTVLT
|
| |