Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2540 |
Symbol | |
ID | 7873979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2737758 |
End bp | 2739803 |
Gene Length | 2046 bp |
Protein Length | 681 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699462 |
Product | protein of unknown function DUF839 |
Protein accession | YP_002889519 |
Protein GI | 237653205 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGC CCGATACGCT CGACGACCTT CCGACCAACC TCTCGTCCAA CGAGCACTTC CAGTCCGTCG TGGAGCGTGC GGTCAGCCGC CGCGGCTTCC TGAAGAGCGG CCTGGGCCTC TCGGCCGTCA CCTTCCTGTC GGGCTCGCTC GCTGCCTGTA CCTCGGACGA CGACACGCCC GTGGCCGGCA CCCCGACCGC CGGGACGCCC CCTGCGCCTG CACCTGCCGC GGGGCCGCTC CTCGGCTTCG CCGCCGTCGC CACATCGAGC GGCGATGCGA TCGTCGTCCC CGCGGGCTAT TCGGCGCAGA TCTTCACTCC CTGGGGATCG CCGCTGTTCA GCGACTCCCC CGCATGGCGA GCGGACGGCA CCAACACCGG TGAAGAGCAG GCTCGCCAGG TCGGCGACAA CCACGACGGG ATGAGCTACT TCCCGATCGA CGGATCGAAC GAAGGCCTCC TGGTGATGAA CCACGAGTAC TGCAACTACG AGTATCTGTT CGGCGCCGAA TTCATGACGC CGTGGACGGC GGACAAGGTC TCCAAGGCGC TCAACGCGCA TGGGGTCTCG GTCCTCCACG TCAAGAAGAA CGGCGCGGGG CGCTGGGAAG TCCACATCGG CTCGCCGTAC AACCGCCGGA TCACCGGCAA GACGCCGATG ACGCTGACCG GCCCCGCCGC CGGCGACGCC CTCCTGCGCA CCACGGCGGA CCCCAGTGGC CTCAACGTCT TGGGAACGCT GAACAACTGC GCCAACGGCA AGACGCTGTG GAACACCTAC CTGACCTGCG AAGAGAACTT CAACGGCTAT TTCGCCACCG CAGCGAGCCC GGCACCGACG CGCAGCGCGG CCTTCGTCCG CTACGGCATC AGCGCCGGCG GTTCCGGCTA CCGCTGGCAC GAGCATGAGG ACCGCTTCGA CTACGCCAAG GAACCCAACG AGGCCAACCG CTTCGGCTGG GTCGTCGAGA TCAATCCCTT CGAGCCCGGC TCCACGCCGA AGAAGCGCAC CGCGCTCGGC CGCTTCAAGC ATGAAAACGC CGAGATGAGG CTCGCCGCAG ACAAGCGCGT GGTCGTCTAC ATGGGCGACG ACCAGGCCAA CGATTACATC TACAAGTTCG TGTCCGATGG CGTTTTCGAC GCAAGCCGTG GGCTTGCAAA CGGCAACCTG CTCGACGCTG GCAAGCTGTA TGTGGCCAAA TTCGACGCCG GCGCGGCCAG CGGCGACTTC ATGGGTGTGG GGGAATGGCT GCTGCTCGAC AAGGCGGCCA ACCCCACGCT GGCGGCAGAC GCCCGCTTCG CCACCCAGGC CGAAGTCCTG ATCCACGCCC GCCTCGCCGC CGACGCCGTC GGTGCGACGA AGATGGATCG CCCGGAGTGG ATCACCACGC ATCCGCAAAC CGGCGAGGTC TATTGTGCCC TGACCAACAA CTCCGGCAGG ACCACGACGG ACGAGGCGAA CCCGCGCGCA CAGAACCGCT ACGGACAGAT CGTGCGCTGG CGCGAGGCCG GCGACGACGC CGCCGCGATG ACTTTCGAGT GGGATCTCTT CGTGCTCGCA GGCAACCCGG TGGCTTACCC CGACCGCCAG GACCTGCGCT CCGGTTCCGC GAACGTATCC GCCGACAACA CCTTCAACAG CCCCGACGGC ATCGGCTTCG ACGGCGCGGG CCGGCTGTGG ATCCAGACCG ACGGAAACTT CTCGAATAGC GGCGACTACG CGGGCCAGGG CAACAACCAG ATGCTGGTTG CCGACCCCGA GAGCAAGGAG ATCCGCCGCT TCCTGGTCGG ACCTTCGGGC TGCGAGATCA CCGGCCTTGC ATTCTCCCCC GACTACCGGA CCATGTTCAT CAACGTGCAA CATCCCGGCG AGGCCGGTTC GCATCCGCGC GCACCGGACG CGAGCATGCG CGGGAGCCTG TCGATGGACG AGTATCTTGC GCAGAACCCG CTCGCGTTCA GTCAGTGGCC CGAGGCCGGC GGCGGTCGCC CACGCTCGGC GACCGTCGTG ATCACGAAGG ATGACGGGGG CGTGGTCGGC TCCTGA
|
Protein sequence | MKKPDTLDDL PTNLSSNEHF QSVVERAVSR RGFLKSGLGL SAVTFLSGSL AACTSDDDTP VAGTPTAGTP PAPAPAAGPL LGFAAVATSS GDAIVVPAGY SAQIFTPWGS PLFSDSPAWR ADGTNTGEEQ ARQVGDNHDG MSYFPIDGSN EGLLVMNHEY CNYEYLFGAE FMTPWTADKV SKALNAHGVS VLHVKKNGAG RWEVHIGSPY NRRITGKTPM TLTGPAAGDA LLRTTADPSG LNVLGTLNNC ANGKTLWNTY LTCEENFNGY FATAASPAPT RSAAFVRYGI SAGGSGYRWH EHEDRFDYAK EPNEANRFGW VVEINPFEPG STPKKRTALG RFKHENAEMR LAADKRVVVY MGDDQANDYI YKFVSDGVFD ASRGLANGNL LDAGKLYVAK FDAGAASGDF MGVGEWLLLD KAANPTLAAD ARFATQAEVL IHARLAADAV GATKMDRPEW ITTHPQTGEV YCALTNNSGR TTTDEANPRA QNRYGQIVRW REAGDDAAAM TFEWDLFVLA GNPVAYPDRQ DLRSGSANVS ADNTFNSPDG IGFDGAGRLW IQTDGNFSNS GDYAGQGNNQ MLVADPESKE IRRFLVGPSG CEITGLAFSP DYRTMFINVQ HPGEAGSHPR APDASMRGSL SMDEYLAQNP LAFSQWPEAG GGRPRSATVV ITKDDGGVVG S
|
| |