Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1134 |
Symbol | |
ID | 7084663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1245576 |
End bp | 1247453 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698149 |
Product | 5'-nucleotidase |
Protein accession | YP_002354789 |
Protein GI | 217969555 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01530] NAD pyrophosphatase/5'-nucleotidase NadN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00179734 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCACC CCGGCATCCC CAAGCTGCTG CTGCTCACCG GCGCGATCAT GCTCGCCGGC TGCACCGACA CCTCCTCCGG CGAGGCCAAG GCCGACGCCG AGCCGGTGCC CGCAGCGGCG CAGCCCGATG CGCCCAGGCC CGACTACCGC CTGGCCATCC TCCACATCAA CGACCACCAC TCCAACCTCG ACGAATCCGA CGCCAGCCTG CGCCTGCGCA CCGGCAGCGA CGACACGCGC GCGAGCGTGA CCGTCAAGCT CGGCGGCTTC CCGCGCGTCG CCACGGCGAT CGCCGAGCTC GCCGCGCGCC ACGAGCACGT CCTCAAGCTG CACGCCGGCG ATGCCATCAC CGGCACGCTC TACTACACCC TCAGCGAAGG CGAAGCCGAC GCCGCGCTGA TGAACCGCGT CTGCTTCGAC GCCATGGCCG TCGGCAACCA CGAGCTCGAC TCCGGCGATG CCGGCCTCCA GACCTTCATC GACCACCTCT GGTCCGCCCC CGACTGCCGC ACCCCGGTGC TCTCGGCCAA CCTCGCCCCG CGCGCCGGCT CGCCGCTCGG CAGCGACAGC GTGCGCCCCT CGGTGGTGGT CGAGCGCGGC GGCGAGCGCA TCGGCATCGT CGGCCTCACC ACCGCCGCCA AGACGCAGAA CGCCTCGCGC CCCGACCCCG GCACCCGGCT GCTCGACGAG GCGGACTCCG CCCAGCGCGA GATCGACCGC CTGCGCGCCC AGGGCATCGA CAAGATCGTC CTGCTCTCGC ACCTCGGCTA CGCCCAGGAC CAGGCGATCG CCGCGCAGCT CTCGGGCGTC GACGTCATCG TCGGCGGCGA CTCGCACAGC CTGCTCGGCG ACGACAGCCT GAAGACCTTC GGCCTCTCGC CCGCCGGCGC CTACCCCACC GCCGCGCGCA ACAAGGACGG CGACGCGGTC TGCGTCGTCC AGGCCTGGCA ATACAGCGCC GTGGTCGGCG AGCTCGACGT GCTCTTCGAC GGCCAGGGCG AGGTCAAGTC CTGCGCCGGC CAGCCGCACA TCCTCATCGG CAGCACGCTC GGCACGCTCG CCGGCGACGC CCTCGCCGCC GCCCGCGCCG ATCTCGCCAG CCAGCCGGCG CTGCGCGTCA CCGAACCGGA CGCCGCCGCC AGCGCGGTGC TCGCCGACTA CGCCAGCCAG GTGAAGGCCT TCGGCGCCGA GCCGGTCGCC GTCGCGCAGC AGAACCTCTG CCTGCGCCGC GTCCCCGGCA CCCGGCGCGA CCCCTCGCGC TCGAAGCTCG ACGGCTGCAA CCTCGATCCG CACTTGATCG CGCACGGCGG CGACGTCCAG CAGCTCGTCG CCGAAGCCTT CCTGCGCCAG GGCCAGCGCT TCGGCGGCGC CGACCTCTCG CTGCAGAACG GCGGCGGCGT GCGCGTCGAC CTGGCCGCGG GCACGGTCAC GATCGGGCAC ATCTACACCG TGCTGCCGTT CAAGAACACG CTGGTCGCGC TCACCCTCAC CGGCGCCGAG CTGCGCGCCA CCCTCGAGGA CGCGATGCAG AGCGTGGTCG CCGGCAACAC CGGCTCCTAC CCCTACGCCG GCGCGCTGCG CTGGCAGGTC GACCTGCGCC AGCCCCTCGG CCAGCGCATC GGCGCGCTCG AGCATCGCAA CGCCCAAGGC CAGTGGGTGG CGCTCGACGA GGCCGCGACC TACCGCATGA TCACCAACGA CTTCATCGCG GCCGGCCAGG ACGGCTACAC CACCCTCGGC ACGCTCGGCG CCGACCGCCG CGAGGAGACC TTCCTCGCCT ACGCCGACGC CTTCCTGCAG TACGCCCGCC AGAACCCCAC CCTCACCCGC CCCGCCACCG CCGACTTCAG CACCCAGGGC TTCGTCGATA CGGAATAA
|
Protein sequence | MRHPGIPKLL LLTGAIMLAG CTDTSSGEAK ADAEPVPAAA QPDAPRPDYR LAILHINDHH SNLDESDASL RLRTGSDDTR ASVTVKLGGF PRVATAIAEL AARHEHVLKL HAGDAITGTL YYTLSEGEAD AALMNRVCFD AMAVGNHELD SGDAGLQTFI DHLWSAPDCR TPVLSANLAP RAGSPLGSDS VRPSVVVERG GERIGIVGLT TAAKTQNASR PDPGTRLLDE ADSAQREIDR LRAQGIDKIV LLSHLGYAQD QAIAAQLSGV DVIVGGDSHS LLGDDSLKTF GLSPAGAYPT AARNKDGDAV CVVQAWQYSA VVGELDVLFD GQGEVKSCAG QPHILIGSTL GTLAGDALAA ARADLASQPA LRVTEPDAAA SAVLADYASQ VKAFGAEPVA VAQQNLCLRR VPGTRRDPSR SKLDGCNLDP HLIAHGGDVQ QLVAEAFLRQ GQRFGGADLS LQNGGGVRVD LAAGTVTIGH IYTVLPFKNT LVALTLTGAE LRATLEDAMQ SVVAGNTGSY PYAGALRWQV DLRQPLGQRI GALEHRNAQG QWVALDEAAT YRMITNDFIA AGQDGYTTLG TLGADRREET FLAYADAFLQ YARQNPTLTR PATADFSTQG FVDTE
|
| |