Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4040 |
Symbol | |
ID | 7873682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4435962 |
End bp | 4437452 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700973 |
Product | hypothetical protein |
Protein accession | YP_002890996 |
Protein GI | 237654682 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTGT CCCCTGAACA CGAAGCGCAG ATCCTGCGCT ACTACCACGC CGAGCGCTGG CGCATCGGCA CCATCGCCGT GCAGCTCGGG CTGCACCGCG ACACCGTCGC GCGCGTGCTC GCCCAGGCCG GCCTGCCCCG GCACGGCCCC GTGCAGCGCG CCTCGGCGAT CGACCCCTAT CTGCCCTTCC TCCACGAGAC GCTCGCGCAG TTTCCGCGCC TTACGGCCGC GCGGCTCTAC GACATGGTGC GCGCACGCGG TTACCCCGGG CGCCCCGATC ACTTCCGCCA CCTCATCGCC CGCCACCGCC CGCGCCCGAG CGCCGAGGCC TACCTGCGGC TACGCACCCT GCCCGGCGAG CAGGCCCAGG TCGACTGGGG GCACTTCGGG CACCTGACGA TCGGGCGCGC ACGCCGTCCG CTGATGGCCT TCGTCATGGT GCTGTCGTGG TCGCGCCAGA TCTATCTGCG CTTCTTCCTC GATGCGCGCA TGGAGAACTT CCTGCGCGGC CATGTCGGCG CCTTCGAGTG CTGGGGCGCG GTGCCGCGCA TCGCCCTCTA CGACAATCTG AAGAGCGCGG TGCTCGAGCG TTGCGGCAAC GCGATCCGCT TCCACCCGAC CCTGCTCGCG CTCGCCGGCC ACTACCGCTT CGAGCCGCGC CCGGTGGCGG TGGCGCGTGG CAACGAGAAG GGGCGCGTCG AGCGCGCGAT CCGCTACGTG CGCGAGGCCT TCTTCGCCGG ATGCGCCTTC GCCGATCTGG ACGACCTCAA CGCGCAGGCC CAGGCCTGGT GCGAGGGCGC CGCCGGCGCG CGGCGCTGCC CCGAGGACGC CTCGATGACG GTGACCGAGG CCTTCGCGGC CGAGCGCGAG CGCTTGCTCG CGCTGCCCGA GGCGCCGTTC CCGACCGATG AGCTGCGCGC GGTGTCGGCG GGCAAGACTC CGTACGTGCG CTTCGATCTG AACGACTACT CGATCCCCCA CACCCATGTG CAGCGCCCCC TCACCGTGTG CGCCGACCCG CTGCGGGTGC GCATCCTCGA CGGCGAGGAC GTCATCGCCA CCCACGCGCG CAGCTACGAC CGCCGCCAGC AGATCGAGTG TGCCGCGCAC CTCGAGGCGC TCGTCGCGCA CAAGCACGCG GCCCGCGCCC ACCGCGCCAC CGACCGCCTG ACGGCGGCCG TGCCGGCCTG CCAGGCGCTG CTCGCCCAGG CCGCCGAGCG CGGTGAGCCG CTCGGGCGCA CCACGCGTGC GCTCACCGAC CTGCTCGATC GCTACGGGGC GGGCGAACTG GCCGTCGCCG TCGACGAAGC ACTCGCGCGC GGCGTGCCGC ATCCCAACGC GGTACGCCTG GCGCTCGAGC GCCGGCGCGA GGCGCCCCCG CCGCTGGGCG TGCCGCTACC CGCGCATCTG AAGACGCGCG ACGTCACCGT GCGCGCCCAC CCCTTGGCCG GCTACGACCG CCTGCTGGAG GACGACCATG ACGACGCCTG A
|
Protein sequence | MALSPEHEAQ ILRYYHAERW RIGTIAVQLG LHRDTVARVL AQAGLPRHGP VQRASAIDPY LPFLHETLAQ FPRLTAARLY DMVRARGYPG RPDHFRHLIA RHRPRPSAEA YLRLRTLPGE QAQVDWGHFG HLTIGRARRP LMAFVMVLSW SRQIYLRFFL DARMENFLRG HVGAFECWGA VPRIALYDNL KSAVLERCGN AIRFHPTLLA LAGHYRFEPR PVAVARGNEK GRVERAIRYV REAFFAGCAF ADLDDLNAQA QAWCEGAAGA RRCPEDASMT VTEAFAAERE RLLALPEAPF PTDELRAVSA GKTPYVRFDL NDYSIPHTHV QRPLTVCADP LRVRILDGED VIATHARSYD RRQQIECAAH LEALVAHKHA ARAHRATDRL TAAVPACQAL LAQAAERGEP LGRTTRALTD LLDRYGAGEL AVAVDEALAR GVPHPNAVRL ALERRREAPP PLGVPLPAHL KTRDVTVRAH PLAGYDRLLE DDHDDA
|
| |