Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1390 |
Symbol | |
ID | 7084511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1545370 |
End bp | 1548972 |
Gene Length | 3603 bp |
Protein Length | 1200 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698407 |
Product | Heparinase II/III family protein |
Protein accession | YP_002355045 |
Protein GI | 217969811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.274329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGATT CCTACGGCCC CCCCGCGGCG CAAGCACCTC GCAGCTACCT CCCGATCGCC CTGCTGTGGG CGCTTTTCAT CGCCTACGGC AGCCTGGTGC CCCTCGAGTT CCGCCCTCGC GCCGACGCCT GGCAGGCATT CATGGACACG CCCTGGCTGT CGCTCGGGGT CGGTTCGCGC GCCGACTGGG TGGCGAACGT GCTGCTTTAC CTGGTGCTGG CCTGGTTCGC TACCGGGGCG GTGTGGACCA GCCGCCTCTC CGCCTGGGTG CGCACGCCGC TGCTCGTCGG CGTGCTCGGC ACCATCCTCG CCCTCGCGGT GGGAATCGAA TACCTGCAGC TGTTCTTCCC GCCGCGAACG GTGTCGCGCA ACGACCTGCT CGCAGAGGCG CTCGGCACCG GCATCGGCAC GCTGCTCTGG TTCGCGGCCG GCCCTCGCCT CGCGGCGATG TGGCGGCGTT TCATCGACGG CGGCACGCAC AGCCTGCGCG CCGTGCTGGG TCTGTACGCA CTGGGCTACC TCGGGCTCGC GCTGTTTCCC TACGACTTCC TGGTGAGCAT GGACGAGCTC GCCGCCAAGC TCGCCCGCCC CGACAGCCTC GGGTGGTTGC CCGGCCTGTC CTGCGGACCC GCATTCGCCT GCGGCATCAA GCTGCTGGTC GAGGCCGTCC TGATGATTCC CTTCGGCATC CTGCTCGCGC TCGGTGTGCG CGACCACGCC GCGCGGCGCC CCCCGGGGAT GGCGGCCGGT CTCGCCGCGG GCGCCCTCGC CGGCGTGGCG ATCGAGGCGG TGCAGGTGGT GCTCGCGTCC GGAACGACAC AGGGCATCTC GGTGCTCACC CGCGCGCTCG GCACCCTGTG GGGACTCGTA CTCGCGCGTA GCGGAATACG GCGCTGGCTG GAGTACTCGC CACAACGCCT GCTGCGCGCG GCGCTGTGGC TCTCGTCCGT CTGGCTCGCC CTGGTGCTGG CGACGAACGG CTTGCTGCCG CTGCGGCTCC AGGCCTCCTG GGCTGCGCTG GAAAAACTCG AGACGCTGCG CTTCCTGCCC TTCTACTACC ACTACTACAG CACCGAGACG GCCGCCGTGC GCAGCCTGCT CTTCGTGGCG GGAAGCTTCG CACCGGTGGG GGTGGTCGCC GCCCTCGCCT TTCCGCACCA TCGCTTCGGC GCCAGCCTGC TCGCGCTGCT GGTCGCCGCC CTCGTCGCGG CGGCGGTCGA GCTGCTCAAG CTGTTCACCG AGGGCAAGCA TCCCGACCCC ACCAACCTGT TGATCGCGGT CGCGGCCGCC TGGCTCGCGC ACCGCCTCGT CGCGCACCTG CTGCCGATCC TGCACCACCA CGGCACGCGC ACGACACCCC CGACGTCCGC TGCGCAGCCC CGGCGCAGAG TCGCGACGCT GCTCGCCGTG GGTGTGGCGC CGGCGGCGCT GCTGCTCGCC ACGGTGCTGC TCGGCCTGCC GCTGGCCGAG CCGCCCGCCG TGGGGGCGTC CGCGCCCACG TACCCGCCTC CTTCGGCCCT GCCACCGGCC GACATCGCCG GCTTTCGCAC CGCCCACCCC CGCCTGCCCC ACCCCTCCCC CGCCGACCTC GCGGCGCTGC GCGCCGGCAA CCCCGCCTAC CTGCAGCAGA CGGCGAGCGC GGCGCGCAGC AACCCGAACG CCCTCTTCGC GATCACGCTG GCGGCCTTCG TCCAGCCCGG CAGCGTCGAT CTCGCCCCGC TGCACGCGCG CCTGGTGGCG AGTCGCTTCA GCGACCGCGG CAGCGGCCAG GTCGAACCGC TCGCGCTCGC CTACGACTGG CTGCACGACC AATGGAGCGC GCAGGAACGC GAGAGCCTGC GCGAGCGCCT CGCCGAGGGC TGCGACTTCC TGATCGAGGT GATCCGCAAG GAACAGCTCT CGCCCTACAA CGCCTTCCTC TACAACACCC CGCTGCAAGG CCTGATGGCG TGCAGCATCG CGCTGTACGG CGACCATCCG CGCGGCGAGG CCTTCATGCG CTTCACCCAC GAGCTGTGGA AGAAGCGTGT GCTGCCGGTC TGGCGCCAGG TCTTCGGGCG CCACGGCGGC TGGCACGAAG GCGGCGAATA CGTGGCGGTG GGCATCGGCC AGGCCATCCA TACCCTGCCC GCGCTGTGGC GCACCGCCAC CGGCGAGGAC CTGTTCGCCA GCGAAGCGGG GATCCGCGGC TTCCTCGACT TCCTCGTCTA TCGCACCCGC CCCGATCGCA CCCATATGCG CTGGGGCGAC GGCGCCTGGT TCGACCGCCA TCCGCGCGAC GCCGCGGCGC TCGCCCTCGA GTACCGCCAC GCCGCCGCCT ACACGCTGGC ACCGCCCAAC GCGGCGCGCG CGCGCGACGG CCGTCGGGTC GGCCCGGTGC CCACCGGGTG GCCATGGGGA CCGCTGTCGG ACGACGGCCT GATCGACCCC GCCGCGCAGA CCCGCATGCC GCTCGCGCGC CTGTTCGACG GCATCGGCCT GCTCGTGGCG CGCAGCGACT GGTCGGAGGA CGCCACCTGG CTCAGTTTCA AGGCCGGCGA CAACTTCTGG TCGCACAGCC ACCTCGACCA GGGCGCGTTC ACGATCTTCA AGGGGGGGCC GCTGGCGATC GACAGCGGCT GGTACGGTCC CGCCTACGGG TCGAATCACC ACATGAACTA CACCTACCAG AGCATCGCCC ACAACCTGGT CACGGTGACC GACCCCGCCG ACGAGCAGCC CGGCCCCGGC TTCGACGCCG CAAACCCGCG CCACTACCCC AACGACGGCG GCCAGCGCCG CATCGGCTCG GGCTGGGGCG TGGATGCGGC GCCGCTCGAC GTCGCACAAT GGCAGGAGCG CAGCGAGACC TACCACACCG GCCGCATCGC GGCCCACCTC GACGACGACG ACCTGGTCGT CGCCGTGGCC GACGTCGGCG CCGCCTACAC CAACCGCAAC TCCGGACGCG GCAGCTTCGC CGACCGCACC CGCCGCGTCG AGCGCATGTG GCGGGTGCTG GGCTATGACC GGATCAACGA TGCGGTGGTG GTGTTCGACG ACGTCGTCGC CAGCCGTGCC GGCTTCGCCA AGCGTTGGCT GCTGCATGCG GTGGAGCCGC CGCTCGTGCG CGGGGACCGC TTCGACCTGT TCATCCCGGG CGACACCCGC CCGGGGCGGC GGGGCGGCAG CCTGCACGGC CACGTCCTCC TGCCGCGCGA CGCGGTGCTC GACACCGTAG GCGGCCCCGG CTTCGAGTTC TTCGTGGACG GGCGCAACCA CGACGAGGAC GGGAAGGTGC AGGCGGCGAT CGCGAAGCTC GGCCACGGCC GCGCCGAGCC GGGCGCCTGG CGCATCGAGC TGCGGCCGCG CGCCGCGGCC GCCGAAGACC GCTTCCTCGT CGTGATGCTG CCCACGCTCG CCGGCGACCA GCCCCAGGCC CGCGTGCGCC TGCTCGAAGC CGGCGCGGAG GTGGGTGCGG AGATCGCCGG GCCGCGGCGC ACCACACGCT GGTGGTTCGT GCCCGGACGC CTGGGCGCAC GCGTGGAGGT GCTCGAGGAC GGTCGCACGC GCAGCCGCGA GATCGTGCCC GGCGGATCCC CCGCCGGAAA CATCACGGAT TGA
|
Protein sequence | MHDSYGPPAA QAPRSYLPIA LLWALFIAYG SLVPLEFRPR ADAWQAFMDT PWLSLGVGSR ADWVANVLLY LVLAWFATGA VWTSRLSAWV RTPLLVGVLG TILALAVGIE YLQLFFPPRT VSRNDLLAEA LGTGIGTLLW FAAGPRLAAM WRRFIDGGTH SLRAVLGLYA LGYLGLALFP YDFLVSMDEL AAKLARPDSL GWLPGLSCGP AFACGIKLLV EAVLMIPFGI LLALGVRDHA ARRPPGMAAG LAAGALAGVA IEAVQVVLAS GTTQGISVLT RALGTLWGLV LARSGIRRWL EYSPQRLLRA ALWLSSVWLA LVLATNGLLP LRLQASWAAL EKLETLRFLP FYYHYYSTET AAVRSLLFVA GSFAPVGVVA ALAFPHHRFG ASLLALLVAA LVAAAVELLK LFTEGKHPDP TNLLIAVAAA WLAHRLVAHL LPILHHHGTR TTPPTSAAQP RRRVATLLAV GVAPAALLLA TVLLGLPLAE PPAVGASAPT YPPPSALPPA DIAGFRTAHP RLPHPSPADL AALRAGNPAY LQQTASAARS NPNALFAITL AAFVQPGSVD LAPLHARLVA SRFSDRGSGQ VEPLALAYDW LHDQWSAQER ESLRERLAEG CDFLIEVIRK EQLSPYNAFL YNTPLQGLMA CSIALYGDHP RGEAFMRFTH ELWKKRVLPV WRQVFGRHGG WHEGGEYVAV GIGQAIHTLP ALWRTATGED LFASEAGIRG FLDFLVYRTR PDRTHMRWGD GAWFDRHPRD AAALALEYRH AAAYTLAPPN AARARDGRRV GPVPTGWPWG PLSDDGLIDP AAQTRMPLAR LFDGIGLLVA RSDWSEDATW LSFKAGDNFW SHSHLDQGAF TIFKGGPLAI DSGWYGPAYG SNHHMNYTYQ SIAHNLVTVT DPADEQPGPG FDAANPRHYP NDGGQRRIGS GWGVDAAPLD VAQWQERSET YHTGRIAAHL DDDDLVVAVA DVGAAYTNRN SGRGSFADRT RRVERMWRVL GYDRINDAVV VFDDVVASRA GFAKRWLLHA VEPPLVRGDR FDLFIPGDTR PGRRGGSLHG HVLLPRDAVL DTVGGPGFEF FVDGRNHDED GKVQAAIAKL GHGRAEPGAW RIELRPRAAA AEDRFLVVML PTLAGDQPQA RVRLLEAGAE VGAEIAGPRR TTRWWFVPGR LGARVEVLED GRTRSREIVP GGSPAGNITD
|
| |