Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0438 |
Symbol | |
ID | 7084948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 499580 |
End bp | 500776 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697470 |
Product | domain of unknown function DUF1745 |
Protein accession | YP_002354113 |
Protein GI | 217968879 |
COG category | [S] Function unknown |
COG ID | [COG4398] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCGA CCCGCTTCGC TGCCGGACAT GCCGCCCACG CCGACTGGCG GGTGGCGCTC GCACGCGCGC TCGACGCCCT GCTGCTCGAC CTGCGCGCCG CGGGAGGCCA TGACCATGCG GGGCGGGACG GGCTGCCCGA ATATACCCTC GGCTTCTGCT ACCTGAGCGA CCGCTTCGCC GCCGACGCCG ACGAGATCGT CGCCGAGTTG CAGCGCCGCC TGCCCGGCGT GCATTGGGTC GGCACTGTGG GCATCGGGGT GGCCGCGACC GGTGCCGAGC ACTTCGACGA GCCGGCGCTC GCGCTGATGC TCGCACCCCT GCCGCGGCGC GCCTTCCGCG TGTTCTCGGG GGTGCAGCCA CTGCGTGCGG ATGCCGAGGA CTTCCACCCG CACACCGCGC TGGTGCACGC CGACGGCCGC ACCCCGGACC TGCAGGAGCT GCTGCCCGAA CTCGCCGAGC GCACCGCGAG CGGTTACCTG TTCGGCGGGT TGACCGCCTC GCGTGGGCGC AGCGTGCAGA TCGCCGACGG TGTGTTCGAC GGCGGCGGCC TGTCAGGGGT GGCCTTCTCC GCCGAGGTCG GCGTCGTCTC GCGCGTCACC CAGGGCTGCC AGCCGATCGG GCCGCGGCGG CTGGTGAGCC GCGCGCGGGA CAACTTCGTC ATCACACTGG ACGGTCGCGG TGCGCTCGAC TGCGTGATGG AGGATCTCGG CCTGCCGCCC GACATGGCGC TGGAGCCGGT CGCCGAAGCC TTGTCGAACA CCCTGGTCGG GCTGCATCCG GGGGCGGACG CGGAGCTCGT CGCCCCCGCC GCCTTCGGTG CCGACATGCT GGTGCGCAAC ATCATCGGCC TCGATCCACG TGCCGGCGTG GTGGCGATCG GCGATGAGGT GGAGCAGGGC GCCTGGCTGA TCTTCTGCCG GCGCGACCCT GCTGCGGCGC TCGCCGACCT GCGCCGCGTG GCGCGCGAGA TTCGCGACGA GCTGGTGGAC AGCGACCGTG TCGCGCTCGG CGCGGTGTAT GTGAGCTGCT CGGGCCGCGG TGGCCCGCAC TTCGGCCGTC CGGCGGCCGA GCTCGAGCTG ATCCGCGACG TGCTCGGCGA CGTGGCGCTG GCGGGCTTCT TCGCCGGCGG CGAGATCGCC CACAGCCGCC TCTACGGCTA CACCGGGGTG CTCACCGTGT TCGCCGCGCC GCGCTGA
|
Protein sequence | MNPTRFAAGH AAHADWRVAL ARALDALLLD LRAAGGHDHA GRDGLPEYTL GFCYLSDRFA ADADEIVAEL QRRLPGVHWV GTVGIGVAAT GAEHFDEPAL ALMLAPLPRR AFRVFSGVQP LRADAEDFHP HTALVHADGR TPDLQELLPE LAERTASGYL FGGLTASRGR SVQIADGVFD GGGLSGVAFS AEVGVVSRVT QGCQPIGPRR LVSRARDNFV ITLDGRGALD CVMEDLGLPP DMALEPVAEA LSNTLVGLHP GADAELVAPA AFGADMLVRN IIGLDPRAGV VAIGDEVEQG AWLIFCRRDP AAALADLRRV AREIRDELVD SDRVALGAVY VSCSGRGGPH FGRPAAELEL IRDVLGDVAL AGFFAGGEIA HSRLYGYTGV LTVFAAPR
|
| |