Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0056 |
Symbol | |
ID | 7083439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 61359 |
End bp | 62705 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697103 |
Product | protein of unknown function DUF88 |
Protein accession | YP_002353752 |
Protein GI | 217968518 |
COG category | [S] Function unknown |
COG ID | [COG1432] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAGCG CCCTCTTCGT CGACTTCGAT AACGTCTATT CCGGCCTGCG CAAGCTCGAC CCGGCCATCG CCGACCGCTT CGCCCGCCAG CCCCTGGAGT GGGTGAACTG GGTGATCGGC GAGCTCGAGC TGCCCGACCA CGCCCCCGCC GGCGCGCGCC GCCGGCTGCT GGTGCGACGC TGCTATCTCA ACCCGCAGGC CTATCAGCGC TTCCGGCCGT CCTTCAACCT CGCCGGCTTC GAGATCATCG ACTGCCCCGC GCTGACGAGC GAGGGCAAGA CCAGCACCGA CATCCACATG GTGCTCGACA TCATCGACCT GCTGCAGCAC GAGGCCCGTT ACGACGAGTT CATCGTGTTC TCGGCCGACG CCGACTTCAC CCCGGTGCTG CGCAAGCTGC GGCGCTGGGA TCGGCGCACC ACGGTGCTGG CGATCGGCTT TCCGTCCGCG GCCTATCGCG CCTCGGCCGA CCTGCTGATC GACCCCGACG AGTTCGTCCG CAACGCCCTC GGCTTCAAGG ACGAGGACGA TGCGCCCGGC GCCGGCGCGC CCCCTGCCGA CGTCGCCCCG GAAAGCCTGT CGCTCGCAGC GAGCGCACGT TTCGCGCAGG CGTCGCCGGA AGCGGCCCGG AGCGTTCCCC CGGCCTCGTT CTCCAGCGGC GCCGACGCCC TCGACCACGA CCTCGCCGAG ATCGTCGAGA CCATGCGCGC CGAGGTTGCG CGCTCGGCCG CGCCCGTCCC CTGCAGCCGC CTGGCGAGCC TGATCACCAC GCGCCACCCC GCCCTCGCCG CCGACTGGAA CGGCAAGGGC AGCTTCCGCA GGTTCGTCGA CAGCCTCGAG CTCGCGCCGC TGCGTTTCGA CTGGAGCAAC AGCGGCGGCT TCGTCTTCGA TCCCGCCCGC TCCGGCGCCG GCATGCAGGA CGGCAGCGTC ACGCCCGACT GGGGCGAGGA TCAGGACATC CTGCCGCTCG CCATGCAGAT CCACGAGGTC TCCGGCGTCC CCCTGCTCGG CCCGGCCGAA TACCGCGCGC TGTTCTCGAT CATCGCGGAC GACGTCGGCC GCCATCCCTT CGACCTCAAG GAAACCGGCA AGCGCGTGCG CGACCGCCTG CGCGAAGCCG GTCTCGGCGG CAGCCGGCTC GACGTCAACT GGATCCTGCG CGGACTGCTG ATGCGCGGCC ACGCCTTCGG CGAGGGGCAG GACCGCGCCG CGACGCTCGC CACCAAGACG GCGAACAACG TGCGCTCGCT CTGCCTGCGC GAGCAGATGA TCCTCGACCG CGCGGCCGAG ACGGCGATCG CGCGCTGGTT CGGCACCCAT CCGGCTCCCA CCCGCAGCGG GCAGTGA
|
Protein sequence | MKSALFVDFD NVYSGLRKLD PAIADRFARQ PLEWVNWVIG ELELPDHAPA GARRRLLVRR CYLNPQAYQR FRPSFNLAGF EIIDCPALTS EGKTSTDIHM VLDIIDLLQH EARYDEFIVF SADADFTPVL RKLRRWDRRT TVLAIGFPSA AYRASADLLI DPDEFVRNAL GFKDEDDAPG AGAPPADVAP ESLSLAASAR FAQASPEAAR SVPPASFSSG ADALDHDLAE IVETMRAEVA RSAAPVPCSR LASLITTRHP ALAADWNGKG SFRRFVDSLE LAPLRFDWSN SGGFVFDPAR SGAGMQDGSV TPDWGEDQDI LPLAMQIHEV SGVPLLGPAE YRALFSIIAD DVGRHPFDLK ETGKRVRDRL REAGLGGSRL DVNWILRGLL MRGHAFGEGQ DRAATLATKT ANNVRSLCLR EQMILDRAAE TAIARWFGTH PAPTRSGQ
|
| |