Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3594 |
Symbol | |
ID | 7873099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3942247 |
End bp | 3943302 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643700534 |
Product | Appr-1-p processing domain protein |
Protein accession | YP_002890564 |
Protein GI | 237654250 |
COG category | [R] General function prediction only |
COG ID | [COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000210462 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAGC TCACGCAAGG TGATCTGCTG AAGCAGGACG ATGTCGACGC CATCGTGAAC ACGGTGAACT GTGTCGGCGT GATGGGCAAG GGCATCGCGC TGCAATTCAA GAACAAGTGG CCGGACAATT TTGCTGAGTA CGCGGCAGCT TGCAAGGCGG GGCAAGTGCG TCCGGGCCGA ATGTTCATCC ACGACTCAGG CGGCCTAGTC AAGCCGAACT ACATCATCAA CTTCCCGACC AAGGACCATT GGCGCGGCGC CTCTAGGATG GCGTTCATCC GCGACGGTTT GATCGACCTA GTGACGCAGG TGCGGCGCCT CGGCATTCGG TCAATTGCCA TCCCGCCGCT AGGTTGCGGG AACGGTGGGC TAGACTGGAC CCAAGTGCGG CCTTTGATCG AAGCTTCATT CGAAGCGCTT CCCGATGTTG AAGTGCGACT CTTCGAACCT GGGGGTGCGC CCAATCCAAA GACGATGGAA GTTCGGACCA AGCGTCCCCG CATGACGCCC GGCCGGGCAG CAATCGTCAA GGTCTTGAGC ACGTACGGTG AGCTGAACTA CGGGCTATCC AAGATCGAGG TTCAGAAGCT TGCGTACTTT CTGCAGGAGG CCGGCGAGCC GCTGCAGCTT CAGTTTGTGA AGCACCACTA CGGTCCGTAC TCCGACACGC TACGCCACGC GCTGAACACG ATGGAAGGGC ACTTCATTCG CGGCCTGGGC GATGGTGTTG TCGAAGCCGA AATCGAGCCC ACGGAAGACG CACTTGCCGA AGCCGAGGCG TTCATCGCAA ACGAAGGCCA TTCGGCGCTC TCAGCCCGTG TTGAGCGCGT GGGGCGGCTT ATCGATGGCT ACCAATCGTC GTATGGCATG GAACTGTTGG CCTCGGTTCA CTGGGTCGCG GCACACGAGC CCGGCGTACG CTCGGTCGAT GAAGCGATTA CGGCGGTGCA CGGCTGGAAC GATCGGAAGA AGCTGCTCAT GCAGCCCGAT CACGTTAAGT TTGCTTGGCA TCGGCTTGCT GAGGAAGGCT GGCTTTCGTC TAGCGCTTTT CCGTAG
|
Protein sequence | MIKLTQGDLL KQDDVDAIVN TVNCVGVMGK GIALQFKNKW PDNFAEYAAA CKAGQVRPGR MFIHDSGGLV KPNYIINFPT KDHWRGASRM AFIRDGLIDL VTQVRRLGIR SIAIPPLGCG NGGLDWTQVR PLIEASFEAL PDVEVRLFEP GGAPNPKTME VRTKRPRMTP GRAAIVKVLS TYGELNYGLS KIEVQKLAYF LQEAGEPLQL QFVKHHYGPY SDTLRHALNT MEGHFIRGLG DGVVEAEIEP TEDALAEAEA FIANEGHSAL SARVERVGRL IDGYQSSYGM ELLASVHWVA AHEPGVRSVD EAITAVHGWN DRKKLLMQPD HVKFAWHRLA EEGWLSSSAF P
|
| |