Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2084 |
Symbol | |
ID | 7085354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2357813 |
End bp | 2359117 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643699104 |
Product | hypothetical protein |
Protein accession | YP_002355721 |
Protein GI | 217970487 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0757382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCGCAA ATTTGTTTGA GCAATTCCGC ACAAATCTTC AGGTCGCCAA CAGCGACGCT ATTTCAAATT CCTATCGGGC GATTACGAAG CGCCTGAACA AGGATTTCTG GGATATCGAC TCGGAAGAAC AGCACTGCTT GCAGGTGGGC TCCTACGGCC GACATACGGC CATCAGGGAC GTGAGCGACC TCGACATGGT CTTCGAGCTC CCTCAATCGG TCTATGAACG TGTCTCGAAA GTCAATGCTA ACGGCCCGTC ACAACTTCTT CAGGAGGTAA GAAAATCACT GAAGGAACGT TACCCGGCAA GCGACATTCG AGCAGCGCAA CAAGTTGTCC AAGTCCACTT CGAAAAGTAC CGTATAGAGG TTTTGCCCGC TTTCAAGCAA GCGGACGGTC GATACCGCTA CGGCGACGCC AACGACGGAG GGTCTTGGGA CAACTACTGC AACCCGCGGG CAGAAATCAA TGCCCTCAAC ACTCAGAATA AAACCTCAAA TAGAAATCTG AAGCGTGTCT GCAAGATGCT CCGCGCTTGG AAGAACAAGC ACGGCGCCCC GATGAGCGGG ATGCTGATTG ATACGCTTGT AAGCAGATTC TTTCAGGAAA ACACTTCGTA CAATGATAAA TCTTACTCTT CATATCCAGC ACTGACGCGT GACGTTTTCG CCTTTCTCGG GAACTTGGAC GAGCAAGACT ATTGGTTAGC TCCAGGCAGT CGCTCACGAG TACAGACGAA AGGAAAATTT CAACGCAAAG CCAAAAAGGC TGCTGCAAAG TGTCAGGACG CGTTGGACGC GGGAAAAGAC ACAAAGAAGG TCAAACTCTG GAAGGAGGTG TTCGGTCGCC GCTTCCCCAG TCTGCAGACC GATTCTGCTA CCAAAGCTTT CGTCGAGGCG CGTCAAGATA CCAAGACGGA GCAGTTTATT GAGGACCTAT ACCCCGTCGA TATTCAGTAC AGCCTCGACA TCGAGTGCAT CGTTGCTTAT GCCGGCAACG AGGAAACGCG TTATCGATTC ATGGAAAGTG TATTTCCATG GTTGAAGCTT GGCCGCAGCC TCACTTTCAT AATCGTGAAC TGTAATGTGC CCGAGCCGTA TGAGATATTT TGGAAGGTGC GGAACGTAGG CGTCGTCGCG GAGCGGCGAA ACATGATTCG GGGCCACATT ACTCGCGACA CCGGGCGTCG TCAGGCAGTC GAAACCACAA GCTTTGGAGG CGAACACTTC GTGGAGTGCT ATGTGATAAA GGACGGAATT TGCGTCGCGC GAGACCTCAT TGAGGTTCCT ATTGAGATTA CATAG
|
Protein sequence | MSANLFEQFR TNLQVANSDA ISNSYRAITK RLNKDFWDID SEEQHCLQVG SYGRHTAIRD VSDLDMVFEL PQSVYERVSK VNANGPSQLL QEVRKSLKER YPASDIRAAQ QVVQVHFEKY RIEVLPAFKQ ADGRYRYGDA NDGGSWDNYC NPRAEINALN TQNKTSNRNL KRVCKMLRAW KNKHGAPMSG MLIDTLVSRF FQENTSYNDK SYSSYPALTR DVFAFLGNLD EQDYWLAPGS RSRVQTKGKF QRKAKKAAAK CQDALDAGKD TKKVKLWKEV FGRRFPSLQT DSATKAFVEA RQDTKTEQFI EDLYPVDIQY SLDIECIVAY AGNEETRYRF MESVFPWLKL GRSLTFIIVN CNVPEPYEIF WKVRNVGVVA ERRNMIRGHI TRDTGRRQAV ETTSFGGEHF VECYVIKDGI CVARDLIEVP IEIT
|
| |