Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3596 |
Symbol | |
ID | 7873101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3943993 |
End bp | 3944982 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700536 |
Product | hypothetical protein |
Protein accession | YP_002890566 |
Protein GI | 237654252 |
COG category | [S] Function unknown |
COG ID | [COG3586] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000023029 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACA TCAAGCTTTT TCGCCTGCAG TCCGGCAAGG CGTTAGAACT CCAAGGTGAC GCCTCCGACC TTGAGAAGCC GCTGCAGACG CTGATCGAGG CCAACCTCGA CACCCTGCTC GGCATCCGTT TCCTGGCCAC CGAATACTCC ACCGGCAAGA CCCACGCAGG CCGGATCGAC TCGCTGGGCC TGGACGAGAA CAACTGCCCC GTCATCCTGG AATACAAGCG CTCGGTCGGC GAGAACGTCA TCAACCAGGG CCTGTTCTAC CTGGACTGGC TGATGGACCA CCAGGCCGAG TTCAAGCTGC TGGTGATGGA CCAGTTGGGC AAGCCCGCGG CCGATGCCAT CGACTGGACC GCGCCGCGGC TGGTGTGCAT CGCGGCCGAC TTCACCAAGT ACGACGGGCA TGCGGTTCAG CAGATCAACC GCAACATCGA GCTGATTCGC TACCGGCGCT TCGGCGACGA GTTGCTGCTG CTGGAGCTGG CCAACGCGGC CAGCGCCAAT GCGGGCAAGG CGACGACAAC CAAGGCCGTG AAGCCAGCCA AGGCAGCGCC TGCGCCAGCG GCAGAGCCCA CCAGCGCCAA AGGTGCGGGC GGCGACCGTT CCTACGCCGA CTGGCTTCCG CTGTTGCCGC CGCACCTCTG CGAGCTGGTG GCATCGCTGG AGGGGCACGT CTTGTCGCTG GGCGACGACG TGCAGCGCAA GGAGCTCAAG CTCTACGTGG CCTTCAAGCG CCTGAAGAAC TTCGCCACGG TGGTGCCGCA AAGGGCTCGG TCGCTCCTTT ACCTGCACGT CGACCCCGAT CAAGTGCAGC CGCTGCCGTC GAATGGGCGG GATGTTCGGC AGCAAGGGCA TTGGGGGACC GGCGACCTGG AACTGGCTCT GGCGTCTCAG GCGGATCTCG ATGCCGTGAA ACCGCTGATC CTGATGGCGT ATGAGGGGCG CGCGGTAGCG GCAACGTCCG CCTCCGGGTA CGGGAGCTGA
|
Protein sequence | MSDIKLFRLQ SGKALELQGD ASDLEKPLQT LIEANLDTLL GIRFLATEYS TGKTHAGRID SLGLDENNCP VILEYKRSVG ENVINQGLFY LDWLMDHQAE FKLLVMDQLG KPAADAIDWT APRLVCIAAD FTKYDGHAVQ QINRNIELIR YRRFGDELLL LELANAASAN AGKATTTKAV KPAKAAPAPA AEPTSAKGAG GDRSYADWLP LLPPHLCELV ASLEGHVLSL GDDVQRKELK LYVAFKRLKN FATVVPQRAR SLLYLHVDPD QVQPLPSNGR DVRQQGHWGT GDLELALASQ ADLDAVKPLI LMAYEGRAVA ATSASGYGS
|
| |