Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1124 |
Symbol | |
ID | 7084653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1232219 |
End bp | 1234069 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698139 |
Product | Heparinase II/III family protein |
Protein accession | YP_002354779 |
Protein GI | 217969545 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.781044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTCG CAATCAAGGC GCGGACGGCG CTGGCGCTCG GTGTGCCGAA CCTGGTGCGG GCGCTGTCGT ATCGCCTGGG GGTGAAGACG GGGCTCAACC CGGTGCGCCG CCTGCGCGGG GCTGCGCCGG CGGGACCGTT CTTTGCCATG CCCGCGTGCG CTGCACGCCC TCCGAAGCCC GAGCGCTTGC TCGTGGCGTC GCAGGCGTGG CAGACGAGCG CGCGGCTGTT CAGCCATTGG CCGATCGCGG TGACGGACGA GCCGCCCGAC TGGCACAGCA ATCTGCTTAC AGGTAAGCAG ATTGCCGAAC CCGATCGGGA GTGGTGGCGG ATCCCGGACT TCGATCCGGC AGTGGGCGAC ATCAAGCAGA TCTGGGAGGC GTCGCGTTTC GACTGGGTGC TGGCCTTCGC GCAGCGTGCG CGCAAGGGCG ACGCGGCCTC TTACGAGCGT CTTGAGCGCT GGCTTGCCGA CTGGAGCGCA CGCAATCCGC CCTACTGCGG GCCGAACTGG AAGTGCGGGC AGGAAGCGTC GATGCGGGTG ATGCATCTCG CCATGGCGGC GGTGATGCTG GGGCAGGTCG ATACGGCCAG CCCGGCGCTG CTGGAGCTGG TGCGCCTGCA CCTGCAGAGG ATCGCGCCGA CGCTGCGCTA TGCGATGGCG CAGGACAACA ACCACGGCAC GTCCGAGGCG GCGGCGCTGT TCATCGGCGG GAGCTGGCTG TCGCGCAAGG GCGTGGCGGG TGGTGCGCGC TGGGCGGCGC TGGGGCGCAA ATGGCTGGAG AACCGGGCTG CACGGCTAAT TTCGCTGGAT GGCACCTTCA GCCAGTATTC GGTGACCTAC CACCGGGTGA TGCTCGACAC CTATTGCATG GTGGAGGTGT GGCGCAGGCG CCTGGCCCTG ACGGCGTTTT CGGGGCAGAT GGCGGTGTTC ATGCAGGCGG CGACGCGCTG GCTGTATGTG ATGACCCGCC CCGAGAACGG CGATGCGCCC AACCTGGGGG CCAACGACGG TGCCCGCCTG CTGCAACTGA CGGATACCGA CTACCGCGAT TTTCGGCCGA GCGTGCAGCT TGCTGCGGCG CTGTTCTGGG GCATGCGCGC CTACGCCGAG GACGGCGCCT GGAACCAGCC GCTGCGCTGG CTCGGGGTGG ACGTGCCGGA GCGCGTCTTT ACGCCCGGCT GGCTGGAGCT TTTCGACGAC GGCGGTTTCG GAGTGTTGCG CCGTGGCGAT GTGCTCGCGG TGCTGCGTTA TCCGCTGTTC CGCTTCCGGC CCAGCCAGGC GGATGCGCTT CACCTCGACC TGTGGGTGGG TGGGCGCAAC CTTCTGCGCG ACGGTGGCAG CTACAGCTAC AACACCGAGC CGAAGTGGTT GAACTACTTC GGCGGGACAG AAAGCCACAA CACGGTGCAG TTTGACGATC GCGACCAGAT GCCGCGCTTG AGCCGCTTCT TGTTCGGCGA CTGGCTGCGG ACGATCGAGC ACGAAGGGCG CAGGGAGGCT GCCGATGCGC TGACCTATGG CGCGGCTTAC CGCGACCGCC AGGGCGCCTA TCACCGTCGC CGGGTCGCGT TGGCAGGCGA TGCCTTGCAT GTGCAGGACC GCATCGAAGG CTTTGCGCGC AAGGCCGTGC TGCGCTGGCG GCTTGAGCCG GGCGAGTGGC AGGTGGAGGG CCATTGCGTG CGCAACGGCG CGCATACGCT GCGCGTGCAG GCCGACGTGC CGCTGGTCCG CTTCGAGCTG ACGACCGGCT GGGAGTCGCG CTACTACCTC GAGAAAACCG AACTACCGGT GCTGGAGATC GAAGTCGATC AGCCCGGTAC ATTGATGAGC GAATATCGCT GGGCACCATG A
|
Protein sequence | MSVAIKARTA LALGVPNLVR ALSYRLGVKT GLNPVRRLRG AAPAGPFFAM PACAARPPKP ERLLVASQAW QTSARLFSHW PIAVTDEPPD WHSNLLTGKQ IAEPDREWWR IPDFDPAVGD IKQIWEASRF DWVLAFAQRA RKGDAASYER LERWLADWSA RNPPYCGPNW KCGQEASMRV MHLAMAAVML GQVDTASPAL LELVRLHLQR IAPTLRYAMA QDNNHGTSEA AALFIGGSWL SRKGVAGGAR WAALGRKWLE NRAARLISLD GTFSQYSVTY HRVMLDTYCM VEVWRRRLAL TAFSGQMAVF MQAATRWLYV MTRPENGDAP NLGANDGARL LQLTDTDYRD FRPSVQLAAA LFWGMRAYAE DGAWNQPLRW LGVDVPERVF TPGWLELFDD GGFGVLRRGD VLAVLRYPLF RFRPSQADAL HLDLWVGGRN LLRDGGSYSY NTEPKWLNYF GGTESHNTVQ FDDRDQMPRL SRFLFGDWLR TIEHEGRREA ADALTYGAAY RDRQGAYHRR RVALAGDALH VQDRIEGFAR KAVLRWRLEP GEWQVEGHCV RNGAHTLRVQ ADVPLVRFEL TTGWESRYYL EKTELPVLEI EVDQPGTLMS EYRWAP
|
| |