Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3010 |
Symbol | |
ID | 7874399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3261229 |
End bp | 3262230 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643699931 |
Product | protein of unknown function DUF58 |
Protein accession | YP_002889985 |
Protein GI | 237653671 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.29026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCC CCGCCGCATC GCTCGCCACG CCGCTGCGCC ACGCCGCCGA CCGCTGGCTG TTCCGCATCG GCGCCCCGGA GCCCGCGCCG ATCCGCCTCG GCCAACGCCG CATCTACGTG CTGCCGAGCG CCGCGGGCAT CGGCTTCGCC TTCGCGCTGC TGGTGATGCT GATCGCGGCG ATCAACTACA ACCTGAGCCT CGGTTACGCG CTCGTGTTCA CCCTCGGCGG CAGCGCCGCA GCGAGCATGG TGCACGCCTT CCGCAACCTC CATGGCCTGT CGATCCGCCC GGGGCGCTGC GCACCGGTGT TCGCGGGCGA GACTGCCGTG TTCTCCCTCC TGGTCGATAA CCCCGCCCCG CGACGACGTC CTGCCTTGCG CCTCGCCGCG CACGGGCAAT GGAGCGCATT CGCGCTCCCC CCCGCACAGG AAAGCGCCGT CACGCTGGCC TGCCCCGCGC TCCGGCGCGG CCGGCTCGCG CTCGGGCGCA GCGTGCTCGA GACGCACTGG CCGCTCGGCC TCATCCGCGC CTGGAGCGTG TTCGTCCCCG AGGCCGAGTG CCTGGTGCTG CCCACGCCGG AGTCCGATCC GCCGCCCCTG CCCCTCGATG CAGGCGGCGA TGCCGACGGC GGCCGGCGCG CACGCGAGGG CGACGACGAC TTCGCCGGCC TGCGCGCGCA TCGCAGCGCC GACTCGCCGC GCCATGTGGC GTGGAAGGTG CTCGCGCGCG GAGGTCCGAT GCTGACCAAG GAGTTCGCCG CCGGACAGGA GCGCGCGCTC CTGCTCGACT GGGAGCGTCT TCCAGCCGGA CTCGACGACG AACGCCGGCT GTCGCGGCTC ACCGCCTGGG TGCTCGCCGC CGAGCGCGAA GGCCTGCGCT ACGCGCTCGC GCTGCCCGGC GTGCGCGTGC CTGCGGCGAA CGGCAGCGCC CATCACGCAC GCTGCCTGCG CCTGCTCGCA CTCCATGGGC TTGCCGACGC AGCGGTGGAG GACGCAGCGT GA
|
Protein sequence | MARPAASLAT PLRHAADRWL FRIGAPEPAP IRLGQRRIYV LPSAAGIGFA FALLVMLIAA INYNLSLGYA LVFTLGGSAA ASMVHAFRNL HGLSIRPGRC APVFAGETAV FSLLVDNPAP RRRPALRLAA HGQWSAFALP PAQESAVTLA CPALRRGRLA LGRSVLETHW PLGLIRAWSV FVPEAECLVL PTPESDPPPL PLDAGGDADG GRRAREGDDD FAGLRAHRSA DSPRHVAWKV LARGGPMLTK EFAAGQERAL LLDWERLPAG LDDERRLSRL TAWVLAAERE GLRYALALPG VRVPAANGSA HHARCLRLLA LHGLADAAVE DAA
|
| |