Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2824 |
Symbol | |
ID | 7873232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3055185 |
End bp | 3058910 |
Gene Length | 3726 bp |
Protein Length | 1241 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699745 |
Product | hypothetical protein |
Protein accession | YP_002889800 |
Protein GI | 237653486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0780717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGA AGATCGACCA ACTGGTGGCC GAACGGCGCG CGGCCCTCGG GCGCAAGGAC GGCGTCGCGG ACAAATGGGG CCTGGCCCTG TCGGGCGGCG GCATCCGCAG CGCGACCTTC TGTTTCGGCC TGCTCGGCGT GCTCGCCCGC AACCGTCTGC TTGAACGCTT CGACCTGCTC TCGACCGTGT CCGGCGGCGG CTACATCGGC GGCATGCTCG GCCGCCTGCT GCACGGCGCA CGCTCCGCGG ACGACACCCG CGCCATCTTC GCGGCCTTCG GCTCCCAGGA GCCACGCTGG TTTCGCTGGT GGCTGCGCGC CAACGGCCGC TACCTGGTGC CGCGCGGACC GGCCGACAAG ACCTTCGCGC TCGCCGTCTT CCTGCGCAAC CTGCTCGCGA TCCACCTCGA GCTCGGCGCG CTGGCCCTCG CGCTCGGCGT GGTGCTGGCG GCGGTCGATG TGGGCGTGTG GGCGGCGATC GCGCACTGGG TGACGGACGG GTGGGTGACG GGCGAGCGCA TCGAGGCCCT GCGCCGGCTG CCCCCCTGGC TGCCCACGCT GTGGCCGGTC GCGATCCCGG TGCTGGCCGT GCTCGGCGGC CTGGCGACCG CGGCGTACTG GGTCGTGCCC TGGGTGGCGT CGGCCGGGCG GCCGGGATGG CGGCCCGGCT TCGTCGCCGT GCAATGCGTG ATCCTGACGA TCGTCCTGGG CCTGTTGGTG CATTACGGCG ACGCCATCAT CGGTCCGCAG TGGCAGCCCG GTCATGCGAT CCGGATGGCG CTGGCCTGGG GCGCGATCGG CCTGGCCGTC GCCTGGCTGA TCGCGATCCC GTGGTCGCGC TTCCTGTTGC GCCACCTCTT CGACGAGACC GAGCGCGATG CGCTGCGGCT CCGCGAGGAG GCGGTGCGGC GTTGGCTGGC CGACTGGCAA TCCCTGTGCA TGAAGGCGAT CGCGGTCGTG CTGCTGCTCG GACTGGTGGA CCGGGTGGCG TGGTTCCTCG CCTTCGAGTT CAAGGCGCAG GCCAATACCG CGCTCGCCCT GGCGGTGGCC GCCGCCGTGC TGCGGGCGGC GATGCCGGCG CTGGGTGGCG GCCCCCAAGG CGGAATCAGC GGCAAGCTGC TGCTGAGCCT GGGCCAGCTC TCCGGCTACG TACTCAGCTT CCTGCTGTGC GCGTGGTGGG TCTCGCTTGT GCACGCGGCG GCGCTGGGGC CGATGTTCGC GACCGAGGGC ACGGTGGACT TCGGCGCGCC CTGGCTCCGG CTGCTCGCGA TCGGTACTGC GGCCGGCGGC TACGTGCTCG CCACCGGGCG GAATTTCGAG TTCCTCAACC TGTCCTCGCT GCACACCTTC TACCGCGCGC GCCTGGTGCG CAGCTACCTG GGCGCGGCCA ATCCGGCCCG CTTCGGCCAG CAGGACACAC TCGGGCCGAC CGCCGTCGTG TCCGAGCGCG GCGCCGCACA CCTCGCCCAC AAGAGCGTGT TCGACCCCGA CCCCGACGAT GACCTCGCGC TGGAAGCCTA CGCGCCGCAT CGGCAAGGCG GCCCGGTGCA TCTGATCGGG GTCTGCATCA ATCAGACGCA CGACCCGCGC GGCGGACTGT TCAACCGCGA CCGCCGCGGC CTGCCGCTGA CGGTGGGCCC CGAAGGCTGG GTGCGGGTGG GCCAGGAACC GTGGTTCCAG GTGACCGGTG CAGGCGCCCT CGGCCTGGGT TCGTGGGTGG CGATCTCGGG CGCCGCGGTG TCGCCCGGCC TGGGCAGCCA GACGCGCGGC GGCATCTCGG CGCTGCTGGC CTTCGCCGGC ATTCGCCTCG GCTACTGGTG GACGCGCGCT GCACGGGAGA ACCTGCAGCG CCCGAAACGC CGCCTCGCGG CCAAGTCACG CGGCCTGCTC AGCGAGGTGT TCTGCAACTT CAAGGGCAGC AGCGGACCCG ACTGGTTCCT CAGCGATGGC GGCCACTTCG AGAACACGGC CGCCTACGCC CTGCTCGCCG AACGCTGCCG CATGATCGTG CTCGCCGACT GCGGCGCCGA CCCGGACTAC CGCTTCGGCG ACCTGGAGAA CCTGGTGCGC AAGGCGCGCA TCGACCTCGG CGCGGAGATC GAGTTCCTGC GCCCGCGCCC CCGCGCAGCG GGCGAGAACG ATCGACCGCC CGGGGTCGAG CTGTTCGGCT CGCTCAACGA TCTGGCCTCG CGCAAGAGCA ACGCCTGCCT CGCGCTCGCC CGTGTGCGCT ACGTCGGCGC GACCGAGCCG GGCTGGCTGG TGCTGGTCAA GCCCAACATC AGCGCCGGGC TGCCGGTCGA CCTGATCAAC TTCGCCGCCA GCAACCCGGC CTTCCCGCAG CAGACCACCG CCGACCAGTT CTTCGACGAA GCGCAGTGGG AGAGCTACTA CCAGCTCGGC TTCAATCTCG GCGACGTGCT CGACAGCGAC TTGCTGGAGG CACTGCGCAG TCGCCACGCG ACGCTGTTCG AGTCCGACAC CGGCGCCCTC ACCGCCGCGA CGCCCCGGGC CTCCGTTACC CCGGCGACGG TGCCGGCGGG CGCGGGCGCG GCCGCCGCGG CCGCCGCCGA GGCCGCGGCC AGCCGGCTGC CCGAGCGCAT CCGCAACAGC GCGGTGAGCG CCTCGATCGG CATCGGCGCG GTGGCCACCG TGCTGGTGTC CTCATGGCAG GCGATCGACG GCGCGATCGG CGCCCTGGCC GAGGAACGCA AGACCGAACG CGCCGCGATC CTGAAGATCA CCGAGGACTG GGCCGCCCTG CCCGGACGCG ACCGCTGCTC GACGACACTC ATGAGCGAGG AGCATCCGAA GCTCACCGCG CTGGCAGGCA GCTTCCTGCA CCATGCGGAC GCGTTGTGCG CCCGACGGGA GGCCGACTGG CTCGCGGAAT CCAGCCTGGT GCAGGAGATC TTCGACGAGG TTGCCGAGCG CTGCGGCCTG GTCGAGACGC GCATGCGCAG CCGCGCGTGC GCCGCGCTCC TCGTCGCCAA CACCCGCCCG GACGCCGGGG GCGCCGAATG GCCACAGATC TGCTTCGCCG GGCGCCCGGC CGCACCGGCC AAGACCAGGC TGCCGACTTA CTGGGCCTAC CGCTACGGCT TCGAGGCACA GCGGGGCAAG TCCGCACACC CGGCCGACTC CTTCGCCGAG GCACGCGAGT TCGCCCGGGA GGAACACAAC GCCGCGCTGG CCTGCCCCAC CGTGGCGCTC GCCGGCGAAC AGCCGACCCC ACCGACCGCG CAGTCCCCGA CGGCGAAGCC GCCCCCTCCA CCCACCGCTC CGCCCCCACG ACCGGCGCCC GTGCCGCCCG TCGCGCCCGT CGCGCCTGCG CCCGCGCCCG CGCCCACTGC GCCCGCCGCG GTGGAGGCGA ACTCGAAGAT CTGCGCCGGC ACGACCATCT ACCTGCAGAT CTTCGGCCCT ACGCAACGCG ACACGGTGCG CGACTACCGG GACGCCTGGC GCAAGCTCGG CGCCTCGGTG CCGCCGATCG AAGACGTCGA GGCCACGGCG CGCGCGGCGC AGCGCACGCC GCCGCGCCAG GTCGCGCTGA CCACCGTGCG CCATCACGAT CCCGCCTCGA AGCGCTGCGC CGAAGAGCTG GGCAAGGCGG TCGGTTTCGG GAACTGGAAG GTCGAGCCGC TCGCGGCATC GCTCAAGCCG ACCCGCGGCG TGATCGAGGT CTGGATCGGC CGCGAGATCC CCGCCAGGCA GGGCGAAGCC AAGTAA
|
Protein sequence | MSEKIDQLVA ERRAALGRKD GVADKWGLAL SGGGIRSATF CFGLLGVLAR NRLLERFDLL STVSGGGYIG GMLGRLLHGA RSADDTRAIF AAFGSQEPRW FRWWLRANGR YLVPRGPADK TFALAVFLRN LLAIHLELGA LALALGVVLA AVDVGVWAAI AHWVTDGWVT GERIEALRRL PPWLPTLWPV AIPVLAVLGG LATAAYWVVP WVASAGRPGW RPGFVAVQCV ILTIVLGLLV HYGDAIIGPQ WQPGHAIRMA LAWGAIGLAV AWLIAIPWSR FLLRHLFDET ERDALRLREE AVRRWLADWQ SLCMKAIAVV LLLGLVDRVA WFLAFEFKAQ ANTALALAVA AAVLRAAMPA LGGGPQGGIS GKLLLSLGQL SGYVLSFLLC AWWVSLVHAA ALGPMFATEG TVDFGAPWLR LLAIGTAAGG YVLATGRNFE FLNLSSLHTF YRARLVRSYL GAANPARFGQ QDTLGPTAVV SERGAAHLAH KSVFDPDPDD DLALEAYAPH RQGGPVHLIG VCINQTHDPR GGLFNRDRRG LPLTVGPEGW VRVGQEPWFQ VTGAGALGLG SWVAISGAAV SPGLGSQTRG GISALLAFAG IRLGYWWTRA ARENLQRPKR RLAAKSRGLL SEVFCNFKGS SGPDWFLSDG GHFENTAAYA LLAERCRMIV LADCGADPDY RFGDLENLVR KARIDLGAEI EFLRPRPRAA GENDRPPGVE LFGSLNDLAS RKSNACLALA RVRYVGATEP GWLVLVKPNI SAGLPVDLIN FAASNPAFPQ QTTADQFFDE AQWESYYQLG FNLGDVLDSD LLEALRSRHA TLFESDTGAL TAATPRASVT PATVPAGAGA AAAAAAEAAA SRLPERIRNS AVSASIGIGA VATVLVSSWQ AIDGAIGALA EERKTERAAI LKITEDWAAL PGRDRCSTTL MSEEHPKLTA LAGSFLHHAD ALCARREADW LAESSLVQEI FDEVAERCGL VETRMRSRAC AALLVANTRP DAGGAEWPQI CFAGRPAAPA KTRLPTYWAY RYGFEAQRGK SAHPADSFAE AREFAREEHN AALACPTVAL AGEQPTPPTA QSPTAKPPPP PTAPPPRPAP VPPVAPVAPA PAPAPTAPAA VEANSKICAG TTIYLQIFGP TQRDTVRDYR DAWRKLGASV PPIEDVEATA RAAQRTPPRQ VALTTVRHHD PASKRCAEEL GKAVGFGNWK VEPLAASLKP TRGVIEVWIG REIPARQGEA K
|
| |