Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2754 |
Symbol | |
ID | 7873494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2981172 |
End bp | 2984129 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699676 |
Product | hypothetical protein |
Protein accession | YP_002889731 |
Protein GI | 237653417 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACGA CGGAGGATGC CGGGCCGGCC ACGCCGACGA CGGTCGAGCT CGAACTCGAA CTCAGGGTAG GCGGCCGTCC ACTTGGGTTG CGCGGGACGC TCACCGGTGG CGGGCTGACG GGCCTTGCGG GGACACTGCG CGGCGAGAAG ACCGATACCC TGGGCGGCCT GCTCGCCGGT CTCGGCGAGG CCTTCGCCGG CCCCGGCGCG GCGCTCGATG CCCTGGGCGG GGGGGGCTTG GCGGCGATCC ATTTCGAGGC GGTCGAGCTC GCCTATCGCA GTACCGCAAC GCCCTGCGTC GTGCTGAACA CCACCCTGGG CGTCGGGGTC GGCCACACCC GGATCGTCGT CGCAAAGCTC CTTCCCGTCC CCGGGCGTGC GGAGGATGAA GGCGGCCTGG TGGCGGGTTT CGAGCTCGAG CTCGAACCGC GGCCGATCGG TGACAAGGGC CTCTCGGAGC TGATCGGCGG GATCGTGGTA GAGCGCCTCG GCGTCTTCCA TGCGAGCCGG GACTGCAGCG GGCTGCACCT CGAGCCGGCG ACCGCGCCGG GCGCCTTCGC GCCCCTCGTC GAGCGCGCCG GGGAGGGCCG CAGCCTGCAC GCGGGGCTGA ACCTGTCGGG GCGAGTGCTC ATCGGTGGCG TCGATCTGCT CGAAGGTTTC GGGCACGACA CCGGCAATAC CCTCGACGCA CAGGCCCCGG CCGCCGAGCC TTCCCCCGCA GCCGCCCCCG CACTCGCATC GCCCGCGCCC GAAGCACCCC TGCCGGCGCG CGGCAGGGCG TTCTGGAAGC GCATCGACAA GCGCATCGGT CCGCTGAGCT TCCGCCGCGT CGGGCTGGTC TACGAGTCCT CGCGGCTCGC CATCGCCCTC GATGCCGGCC TGCGCATCGG CGGCCTGTCC TTCGAGCTGG TCGGCTTCGG CGTCGATTGC CCGATCGCGG CGTTGCTGAA GGGGCCCCAC GCGGCGATGG ACGGCCTCGG CGTCCGGCTC GAGGGCGCGA TGCTCGGGCT CGAGACCGGG GCGATCGCGA TCAGCGGCGG GTTGACGCGC GCGCCGGGAG AGACGCTGCG TCTGGACGGC AGCCTGCGCG TGAGCACGCC GGTGCTCAGC GTGAGCGCGA TAGGCTCCTA CGAGCGCATC GAGGGCGCGC ACTCCTTCTT CGCCTTCGCC GCCTTGCACA AGGAGATCGG CGGTCCGCCC TTCTTCTTCG TCACCGGGCT GTGCCTGGGC ATCGGCATCC ACCGCCGGCT CACCCTGCCG CCGATCGAGC AGGTGCACAG CTTCCCCTTG CTGCGCGCCG CGGCCGAGCC GGATTACCTC GGCAAGGATG CGGATCCGCG TCGCATCGGC GCGCTGGTGG ACACCTGGCT CGCCCCCGAG CGCGACAGCA TGTGGATCGC CGGCGGCGTG CGCTTCACCT CCTTCGGCAT CGTGGAGTCG GTGGCGATGC TGTCGGTCGC CTTCGGCAAC CGCCTGGAGA TCGGCGTGCT CGGCCTGTCC CGCCTGCAAC TGCCGCGCAA GGCGGGCGGC GAGGCTGCCA TGGCCTGCGT GGAGATGGAA CTGCGTGTGG TGATCGCGCC CGACGACGGC CTGGTCGCGG TCGAGGGCCG GCTCACCGAG AACTCCTTCG TGCTGCGACG CGATTTCCGC CTGCGCGGCG GTTTCGCCTT CTTCGCCTGG TACGCGGGCC CGCATGCGGG TGATTTCGTC GTCTCGGTGG GTGGCTACCA TCCATCCTTC CAGGTTCCGG CGCATTACCC GCGACCGGAG CGGGTCGAGT TCAACTGCCA GATCGGTCAG GTCACGATCA GCGGGCAGTG CTACTTCGCC CTGTGTCCCG CGGCGATCAT GGCCGGCGGC AGCCTGTCGA TCGTCTACGC TTCGGGCGGC ATCCGCGCCT GGCTGGTCGC GCGCGCGGAT TTCCTGATGC AGTGGAAGCC GCTCCACTAC GAGGCCGCGG TTGCGGTCTC GCTGGGCGTG CAGCTGAACA TCAAGATCTG GTTCGTGCGC ATCCGCCTGT CGATCGAGCT CGGCGCGGCG ATCGCGCTCT ATGGTCCGCC GCTCGCCGGC AGCGTGCGTA TCTCGCTCTA CGTGGTCAGC TTCACGGTCG GCTTCGGGCC GCCCAAGAGC CTGCCGCCGC CGATGGTCTG GGAGTCGGAC GACCCCGAGC GCTCGTTTGC GCACAGCTTC CTCGGCAACC CCGACGTCAC CCGCATCTCG GTCGTGGATG GCCTGCTCGA TACGCCCGCC GCGCCCCCGG GCAGCGCGCC CCGGCGCCCG GTGCTGCAGG CGCATCGCCT GCACCTGCGC TGCCAGAGCA GCGTGCCGGC GACCGAGCTG CGCTTCGACG GCCGCGAGCT GCACCCGCGC GGGGGTGGCA CCTGGCCGCA GCTGGGCGTG CAACCGATGG GCCTGGGGCG CTTTCATTCA CGGATCGAGC TCACGCTCGA GGCGCTCCAC CCGGATGGCA GCGTCCGGGG AGACGCGCAG GCCGAACTCG ACCTCGCGCC CCTCACCGTC AGTGTCCCCA GCGCGCTGTG GAGTCCGCGC CCGCCCGGCA TCGATATCCT GTCCGGGAAA GCCCTCATCG ACGGCGCGCC GGTGGGCATC GAACTGCGCG GCCGGGTCGA TCCGGACACA AGGGTCGGCC CGGCGCTCGA GCTGGAGACC TTCGCGTACG ATCGGGTCGA GTATCCCTGC AGCGATGTCG GTGCGCTCCG CCCGGCGACG GCCCTGCCCG GGAGCGCGGC GAGGCTGGGC GACACCCTCA TGGCGGGCGT CGTGGTCGAG CGCCGCAGGG CAATCGTGGA CTGCCTCAAC GCCGGCCGCG GCGTGCGCAA GCTGAGCGCC GACGCCGAGC TGCCCATCCT CGCCGCGGCG CCCGAGCACG TCCTCGACGT CGAGCCCCTG ATGGCGCGGA TCGGCCAGGA CGTGCCGCGC ATGTTCGCCG AGATCTGA
|
Protein sequence | MSTTEDAGPA TPTTVELELE LRVGGRPLGL RGTLTGGGLT GLAGTLRGEK TDTLGGLLAG LGEAFAGPGA ALDALGGGGL AAIHFEAVEL AYRSTATPCV VLNTTLGVGV GHTRIVVAKL LPVPGRAEDE GGLVAGFELE LEPRPIGDKG LSELIGGIVV ERLGVFHASR DCSGLHLEPA TAPGAFAPLV ERAGEGRSLH AGLNLSGRVL IGGVDLLEGF GHDTGNTLDA QAPAAEPSPA AAPALASPAP EAPLPARGRA FWKRIDKRIG PLSFRRVGLV YESSRLAIAL DAGLRIGGLS FELVGFGVDC PIAALLKGPH AAMDGLGVRL EGAMLGLETG AIAISGGLTR APGETLRLDG SLRVSTPVLS VSAIGSYERI EGAHSFFAFA ALHKEIGGPP FFFVTGLCLG IGIHRRLTLP PIEQVHSFPL LRAAAEPDYL GKDADPRRIG ALVDTWLAPE RDSMWIAGGV RFTSFGIVES VAMLSVAFGN RLEIGVLGLS RLQLPRKAGG EAAMACVEME LRVVIAPDDG LVAVEGRLTE NSFVLRRDFR LRGGFAFFAW YAGPHAGDFV VSVGGYHPSF QVPAHYPRPE RVEFNCQIGQ VTISGQCYFA LCPAAIMAGG SLSIVYASGG IRAWLVARAD FLMQWKPLHY EAAVAVSLGV QLNIKIWFVR IRLSIELGAA IALYGPPLAG SVRISLYVVS FTVGFGPPKS LPPPMVWESD DPERSFAHSF LGNPDVTRIS VVDGLLDTPA APPGSAPRRP VLQAHRLHLR CQSSVPATEL RFDGRELHPR GGGTWPQLGV QPMGLGRFHS RIELTLEALH PDGSVRGDAQ AELDLAPLTV SVPSALWSPR PPGIDILSGK ALIDGAPVGI ELRGRVDPDT RVGPALELET FAYDRVEYPC SDVGALRPAT ALPGSAARLG DTLMAGVVVE RRRAIVDCLN AGRGVRKLSA DAELPILAAA PEHVLDVEPL MARIGQDVPR MFAEI
|
| |