Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1970 |
Symbol | |
ID | 7084438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2221486 |
End bp | 2222556 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698995 |
Product | hypothetical protein |
Protein accession | YP_002355617 |
Protein GI | 217970383 |
COG category | [S] Function unknown |
COG ID | [COG4255] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00552765 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATCC AGCTCCTGAT CCCCGGCCTG CTATGGCCTG TCGCAACCTT GCTCGGGCCG GCCTCCGGAC TCGCGCTCGA GGGTCTCGCG ACCCTGCTCG GCCGCGGCCG CCGGGAGCTG ACCGCCTTCG AGCCTTACGA TCGCCAGCTC GCGCGCCTGT TCGGCGTGCA CGGCGACACC CTGCCGATGG CGACCCTGCG CCGCCTGGGC GAGGCCGATG CGCCGGCGCC CGAGCCCGGC AGGCACTGGC TGTGCGCGGA TCCGGTGAAC CTGTCGTTCG CCCGCGAACA CCTGCTGCTG CAGGCTTTTC CGGACGAAGA GCTGGACGCG GCGGAGAGCG CCGAGCTGGT CGCCGAGCTC AACGCGATCT TCGCCGACCT CGGCCGCTTC GAAGCCTGCA CGCCCACGCG CTGGTACCTG CGCCTGCACC GCCCGACCGC GGTCACGCTC TACCCGCTCG ACGACGTGAC CGGGCGCCCG GTCAAGCACT TCCTGCCCGA AGGCGAAGAT GCCCGGTTGT GGCAGCGCAC CATGAACGAA GCGCAGATCG TGCTGCACAA CCATGCGCGC AGCCGCGCCC GCGAGGAGGC CGGCCACCGC GCGGTCAACA GCGTCTGGCT ATGGGGCGCG GGCGCGCTCG ATGCGCCGCC GCGGGCGCCC GCCCGCCAGG TCCAGGCGAG CGATCCGGTC AGCATCGGCC TCGCGCGCGC TGCCGGGGTG GCGTTCGGTG CGCCGGATCC CGCTGCAGCG CTTGCGCAGG ACACGCTGGT CGTCCTCGAC GAGCTGCGCA AGCCCGCACA GCAGCTCGAC CTCGACACCT GGCGGCGCGG CCTCGAGGCG ATGGAGCGCG ACTGGTTCGG CCCGCTCGCC GAGGCCTTCC GCGCCGGTCG CATCGACACC CTGCGTCTGA CCGCCCCCGG CGATCGCGGC ACGCTGCAAC TCGAGCTGCG CGCCGGCGAA CGCTGGAAGT TCTGGCGCAA GCCCTACGCC TTCGACGCGC TGCTGAAGTC CATCGCCCCC GCGCCGATGC AGATGCCCGA CGCCCCGCGC CCCGCCCATG GCGCCCCATA G
|
Protein sequence | MQIQLLIPGL LWPVATLLGP ASGLALEGLA TLLGRGRREL TAFEPYDRQL ARLFGVHGDT LPMATLRRLG EADAPAPEPG RHWLCADPVN LSFAREHLLL QAFPDEELDA AESAELVAEL NAIFADLGRF EACTPTRWYL RLHRPTAVTL YPLDDVTGRP VKHFLPEGED ARLWQRTMNE AQIVLHNHAR SRAREEAGHR AVNSVWLWGA GALDAPPRAP ARQVQASDPV SIGLARAAGV AFGAPDPAAA LAQDTLVVLD ELRKPAQQLD LDTWRRGLEA MERDWFGPLA EAFRAGRIDT LRLTAPGDRG TLQLELRAGE RWKFWRKPYA FDALLKSIAP APMQMPDAPR PAHGAP
|
| |