Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1874 |
Symbol | |
ID | 7084297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2116906 |
End bp | 2118558 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698897 |
Product | hypothetical protein |
Protein accession | YP_002355522 |
Protein GI | 217970288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.480052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAGCA TCGATTCCGC CCTGCGCGCC GAGCGCCCGA TGCGTCCCGC CCGCGGCGCC CCCTCCGCGG CTCCCCACCA GCCGCAGGTC ATCGCCCCCT TCTTCGCCGG CGAAGGCGAG GGGCTGACCC TGCATCGCTA CTTCGACGAG GACTTCGTCG CGCGTTTCCT CAACGACGCC CAGGCTGGCC GGCTCACCGG GACTGCCGCG CAGCCATGGT TCCGCGCCGA CCGTTTCGGC CGCTTCGGCG CCGAGCCGAC CCTGCGCCTG CCCCTGCACC GCTGCTTCTA CCTCGTGTGC GCCGAGGTGC GCTGCATGGC CGCAGGCGGC CCCGCCTTCG ATCCGCGCAA GATCCTCGGC GCCGGCCTCG TGGTGCGCCG GGTAGCGGCC GACGGCTCGG CGCAGCGCTG GATGATCGCC GACGGCCAGC CATTGGGCTG GCGCGACGGC AGGATCCCCG ATCACGACCC CGACGACGTG CACCGCCTCC GCGCGCGCGG GCTGCTGCCG GTGCGCTTCC CCGAACCGCC CTATTCCGGA GAGGAGGTCG CCCCCCTGCA CACGCTGCTG GCGACCCGCC GCGACGCACG CGGAGTCGAG CGCAAGCACA CCCTGCTGTG GGGCTACCTG CCGCTCTCGG GCAGCGCCCG CGTGCAGGCC GAGGCGCCGC TGCCGCAGAC CGACGGCGGC GACGCCCCCG ACTTCGGGCT GGAGCATGCA TGGCCGCTCG GCACCCGCGA CGCCAGGCCC TGGGTGGACG GCGACGGCCT CGTCGTCACC GAGGGTGTCG CCAGCGTCGG CTTCGTCGAG CTGCTGCAGA CCCTGGTCGC GCAGTTCCGC ATCGACGACG CCACCGACCC CGACAACGCC GGCCTCCGCA CCCTGCTCGG CCAGATGCGC TTCCACCACT TCGAGGTCCG CCTCGTCGAG GGCGAAGGCT TCCCGCAACG GGTGCCGGTG TTCGGCGACA CGGCACTCGA ATGGCTGGAC CGCAGCGCGG CGGCCCTGGT CGAGTTCTTC GCCCGCATCG ACCGCGGCGA GTTCGTCGCC GGCCGCACCG CGCTGCCCGA CGGCACGGGC GGAAGCCGCA AGGACCGCCT GCTGCTCTCC GAACAGCAGG CCGAGAACCT GCGCAGCCTG GTCGGGCTGC GCCTGGCGCG CGCCCAGGTG CGCTTCGACG ACGGCCTGCC CATGCCGCGC TACGGCCAGG GCGACGACGA CCGCTTCACC GCCCTGCCCT TCATCCGCTG GCACGACGAG TGCGGTTGCG AGCGCGTCGC CTGGGGCCCG GCCAGCCACG TCTTCCGCGT CGCCTCGCCC TTCGACCCCG AGGCGCAGCG TCCGACCACG GTGGTCCTGC CCGCGCTCGA CGACCTCAAG CGCGGGCTTC CGCGCGGCGT GGCGATGCTG GCGCCCAAGT CGCTCGCCGA CGTGCTGCGC AAGATCAGTC CGGACGCCGA CATGAAGGGC GACGGCCCGG GCAACCGCAG CGGCGCGTGC TGGAGCTTCA GCTTCAGCCT GCCGGCGATC ACGCTGTGCG CGACCATGCT GCTGATGGTC GTGATCAACC TGCTGAACCT CTTCCTCGGC TGGCTGCCGC GGGTATTCCT CGCGCTGCCG CGCCTGTGCC TGAAGGCGCT CCGGGGCAAG TGA
|
Protein sequence | MVSIDSALRA ERPMRPARGA PSAAPHQPQV IAPFFAGEGE GLTLHRYFDE DFVARFLNDA QAGRLTGTAA QPWFRADRFG RFGAEPTLRL PLHRCFYLVC AEVRCMAAGG PAFDPRKILG AGLVVRRVAA DGSAQRWMIA DGQPLGWRDG RIPDHDPDDV HRLRARGLLP VRFPEPPYSG EEVAPLHTLL ATRRDARGVE RKHTLLWGYL PLSGSARVQA EAPLPQTDGG DAPDFGLEHA WPLGTRDARP WVDGDGLVVT EGVASVGFVE LLQTLVAQFR IDDATDPDNA GLRTLLGQMR FHHFEVRLVE GEGFPQRVPV FGDTALEWLD RSAAALVEFF ARIDRGEFVA GRTALPDGTG GSRKDRLLLS EQQAENLRSL VGLRLARAQV RFDDGLPMPR YGQGDDDRFT ALPFIRWHDE CGCERVAWGP ASHVFRVASP FDPEAQRPTT VVLPALDDLK RGLPRGVAML APKSLADVLR KISPDADMKG DGPGNRSGAC WSFSFSLPAI TLCATMLLMV VINLLNLFLG WLPRVFLALP RLCLKALRGK
|
| |