Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0307 |
Symbol | |
ID | 7085608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 350282 |
End bp | 352636 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643697346 |
Product | type III restriction protein res subunit |
Protein accession | YP_002353994 |
Protein GI | 217968760 |
COG category | [S] Function unknown |
COG ID | [COG4951] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACC ACGATGAACT CCGCGCGCTC CGCGCAGAGA ACGGCCGGCT GATCGCGCTG CTCGAATCGC ACGGCATCGA ATGGCGTGCG CAGCAGCGGC CAGCCTCATC GCCTGTCGAG CCAGCGCGGC TGTCCGCTGA AGAGAAGGTC TCGCTCTTCC GACGGCTTTT CCGCGGTCGC ACGGACGCCT ACCCGGTCCG ATGGGAAAGC AAGACCACGG GCCAATCGGG CTACGCGCCG GCCTGTGCAA ACGAGTGGCG TGCCGGAGTG TGCGAAAAGC CGCGCATCAA GTGCGGCGAC TGCGCCAATC GCCTGCTGAT CCCACTCTCC GATGCCGTGA TCTATGACCA TCTTGCTGGC GAGCACACGG TCGGGGTCTA TCCGCTGCTG GAAGATGACA CCTGTTACTT CCTGGCGGTC GATTTTGACG AAGCTGCCTG GCGCGATGAC GCGCGCGCCT TTATGCAGTC CTGCGAGGAA CTGGGCGTTC CGGCCGTGCT GGAAATTTCA AGATCAGGCA AGGGCGCACA CGCGTGGGTG TTCTTCGCCA GTCGCGTTGC CGCCCGTGAC GCCCGGCGCC TGGGTACGGC CATCATCAGC CATACCTGTT CGCGTACCCG GCAACTGAAG CTGGAGTCTT ACGACCGCCT GTTTCCGAAC CAGGACACGA TGCCCAAGGG CGGCTTCGGC AACCTGATCG CCTTACCGTT GCAGAAACGG CCTCGCGGGA GTGGCTGCAG CGTATTCGTT GATGCTGACC TGCGGCCATA CCCGGATCAG TGGGCGTTCC TGGCGTCCGT CCGGCCGATG GCGCCGCACG ACATCGAACC GACCATCCTG CTGGCGACGG GCGGCGTCCA CCCGTTGGAT GTGACATTCA TCGAAGATGA AGAGTTGGCC ACACCTTGGA AGCGGCAGAG CACGTCAATC AAGAAGCTGG CCGGGCAGAT GCCCAAGTCC CTGACCGTGA CGCTGGCCAA CCTGATCTAT TTTGAGAAGG CCCAACTGCC GCAGGTACTC GCCAATCGTC TGATTCGTCT GGCCGCCTTC CAGAACCCAG AGTTCTACAA GGCTCAGGCC ATGCGGATCT CGGTGTGGGG CAAGCCGCGC GTCGTCGGCA ATGCGGAGAA CTACCCGCAG CACATCGCTT TGCCTCGCGG TTGCCTCGAT GCTGCCCTGG ATCTACTGCG GGACAACGGT ATCGCCTGTG ATTTGCGGGA TGAGCGCTTC GGCGGCGACC CCATCGATGT CGCCTTTGCC GGCACTTTGC GTTTGGACCA GGAAGCCGCA GTTGCCGGAA TGCTGCACCA CGACACCGGC GTGCTCTGCG CTCCGACAGC CTTCGGCAAG ACGGTCACCG CCGCCGCAAT GATCGCGCGG CGGGGCGTGA ACACCCTGGT GCTGGTGCAT CGCACAGAAC TGCTCAAGCA GTGGCAGGAG CGTCTGCAAG CCTTCCTGGG TGTTGGTCAG GGTGTGGTCG GCACCATTGG CGGTGGCAAG GCCAAGCCCA CGGGCAAGAT AGACATCGCC GTTATGCAAT CCTTGTCCCG CCAGGGCGAG GTCAACCCAC TGGTCGAAAA CTACGGGCAG GTGATTGTGG ACGAGTGCCA TCATGTCGGC GCAGTGTCAT TCGACGCGAT CCTGAAACGG TCCAAGGCCA AATACGTGCG GGGACTGACG GCAACGCCCA TCCGCCGTGA TGGTCAGCAG CCCATCATCT TCATGCAGTG CGGGCCGATC CGATACACGG CGGCGAAGCC AGTCGGTGCA CCGCACGATC TCGAAGTGCT GCCGCGTTCG CGCTTCACAC GGATCGACCT GCCGACCGAT GCAGGCATCC AGGACGTTTT TCGGCATCTC GCCAATGACC GGGCCAGAAC TGAAGCTATC GCCGACGAGG TGCGCGATGC CCTCGGGCAG GGGCGCAAGG TGCTGGTCCT GACCGAACGT ACTGAACACC TTGATGCGAT CAAGGCAACC CTTGATGGAT TGGAGCCCGC GCCTTTCGTT CTGCACGGTC GAATGTCCAG AAAGCAGCGA GCGGCGCTGG TTGCTGACCT GGATGCACTG CCGCCCGACG CGCCGCGCGT CCTTCTTTCG ACGGGGAAGC TGGTTGGCGA GGGCTTCGAT CACCCGCCGC TCGACACGTT GGTGCTGGCC ATGCCTGTGT CCTGGAAGGG CACTCTGCAG CAGTACGCCG GACGCCTGCA CCGGGAGCAC GCTAGCAAGA CCGACGTACG GATCATCGAT TTCGTGGATG CGGGTCATCC GGCGTTACTG CGGATGTGGG ACAAGCGGCA GCGCGGTTAC CGTGCGATGG GGTACAAGGT CGGCCCCGAT GGCCCTGCGG AATGA
|
Protein sequence | MADHDELRAL RAENGRLIAL LESHGIEWRA QQRPASSPVE PARLSAEEKV SLFRRLFRGR TDAYPVRWES KTTGQSGYAP ACANEWRAGV CEKPRIKCGD CANRLLIPLS DAVIYDHLAG EHTVGVYPLL EDDTCYFLAV DFDEAAWRDD ARAFMQSCEE LGVPAVLEIS RSGKGAHAWV FFASRVAARD ARRLGTAIIS HTCSRTRQLK LESYDRLFPN QDTMPKGGFG NLIALPLQKR PRGSGCSVFV DADLRPYPDQ WAFLASVRPM APHDIEPTIL LATGGVHPLD VTFIEDEELA TPWKRQSTSI KKLAGQMPKS LTVTLANLIY FEKAQLPQVL ANRLIRLAAF QNPEFYKAQA MRISVWGKPR VVGNAENYPQ HIALPRGCLD AALDLLRDNG IACDLRDERF GGDPIDVAFA GTLRLDQEAA VAGMLHHDTG VLCAPTAFGK TVTAAAMIAR RGVNTLVLVH RTELLKQWQE RLQAFLGVGQ GVVGTIGGGK AKPTGKIDIA VMQSLSRQGE VNPLVENYGQ VIVDECHHVG AVSFDAILKR SKAKYVRGLT ATPIRRDGQQ PIIFMQCGPI RYTAAKPVGA PHDLEVLPRS RFTRIDLPTD AGIQDVFRHL ANDRARTEAI ADEVRDALGQ GRKVLVLTER TEHLDAIKAT LDGLEPAPFV LHGRMSRKQR AALVADLDAL PPDAPRVLLS TGKLVGEGFD HPPLDTLVLA MPVSWKGTLQ QYAGRLHREH ASKTDVRIID FVDAGHPALL RMWDKRQRGY RAMGYKVGPD GPAE
|
| |