Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2527 |
Symbol | |
ID | 7873966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2724313 |
End bp | 2726274 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699449 |
Product | helicase c2 |
Protein accession | YP_002889506 |
Protein GI | 237653192 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0816709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACC CCGCCCCGAT CTTCGCCGAC GACGGCCCGC TCGCCGCCGC GATCCCCGGT TTTCGTGCCC GTCCGCAGCA GATCGAGATG GCGCAGAAGA TCGCCGAGGC GCTGCGCGAG AACCGCGTGC TGGTGGCGGA GGCCGGCACC GGCACCGGCA AGACCTTCGC CTACCTGGTG CCGGCGCTGC TGGCCGGGGG CAAGGCCATC ATCTCCACCG GCACCAAGAC GCTGCAGGAC CAGCTCTTCA ACCGCGACCT GCCCACCGTG CGCGCCGCGT TGAAGGTGCC GGTGACGATC GCGCTGCTGA AGGGGCGCGC CAACTACGTC TGCCACTACC ACCTCGAGCG CAACGCGCGC GACGGCCGCT TCCTCACCGC GCAGGACGCC GCAGACCTGC GCGCGATCAC GCGCTTCGCC AGGCTCACGC AGACCGGCGA CAAGGCCGAA TGCACCGACG TGCGCGAGGA CTCGCTGGCC TGGCTCGCCG CGACCTCGAC ACGCGACAAC TGCCTCGGCC AGGACTGCCC GCACAAGGAC GAGTGCTTCG TGATGCAGGC GCGGCGCAAC GCGATGGAGG CCGACGTGGT GGTGGTCAAT CACCACCTCT TCTTCGCCGA CGTGATGTTG CGCGACGAGG GCATGGGCGA GCTGCTGCCG GCGTGCAACG CGGTGATCTT CGACGAGGCC CACCAGCTCC CCGAGACCGC GAGCCTGTTC TTCGGCGACA GCGTGTCCAC CGCCCAGGTG CTGGAGCTGG CGCGCGATAC GCGCTCCGAG ACCGTGGCCG CGGCGCGCGA CTGCGTGGCG ATGATCGACC AGACCCGCAA CCTGGAAAAG GCCGCACGCG ACCTGCGCCT GGTGTTCGGG CCGGAGAGCG CGCGGCTCTC CGCCGCGCAG GCCGGCGAGC ACGAGAACTT CGACGTCATG GTCGAGGCGC TGGAGAAGGC GCTCGCCGAC TTCCACGCGG TCCTCGCCAC CCAGGCCGAG CGCTCCGAGG GCCTGGGCAA CTGCCTGCGC CGCACCGAGG AGATGTCCGA ACGCCTGGCG AGCTGGCGCA AGCCCGAAGA CAAGGAGCTG ATCCGCTGGG TCGAGGTCTT CACCCAGTCG CTCGCGCTCA ATGCCACCCC CCTGCACGTG TCGGACGTGT TCAAGCGCCA GCTCGAAGGC CACCCGCGCG CGTGGATCTT CACCTCGGCG ACGCTGGCGG TGGGCAAGGC CGACTTCGGT CACTACTGCC GCGAGCTGGG CCTGGCCTGG ATGGACCCGC CGCCGCTCAC CGCGGTGTGG GGCAGCCCCT TCGACTACGC CGAGCAGGCG CTGCTGTACG CGCCGGCCGG CATGCCCGAG CCCAACTCGC CCGACTACAC CGAGCGTGTC GCGAAGGTCG CGCTGCCGCT GATCCGCGCC GCAAGAGGGC GCGCCTTCGT GCTGTGCACC TCGCTGCGTG CGATGCGCCG CATCCACGAG CTGATCCTCG ACGGGCTTGC GCAGAGCGGT GACGCGCTGC CGGTGCTGCT GCAGGGCGAG GGTTCGCGCA CCGAGCTGCT CGAGCGCTTC CGCCGCCTGG GCAACGCGGT GCTGGTGGCG AGCCAGAGCT TCTGGGAGGG CGTCGACGTG CCGGGCGACG CGCTCTCGCT GGTGGTGATC GACAAGCTGC CCTTCGCGCC GCCCGACGAC CCGGTGCTGG CGGCGCGCGT CGAGCACATG CAGAAGCAGG GCCTGAGCCC CTTCGTGCAT CACCAGCTGC CCAAGACCGT GATCAACATG AAGCAGGGCG CCGGGCGCCT GATCCGCAGC GAGCGCGACC GCGGCGTGCT GTGCATCTGC GACCCGCGCA TGATCGACAA GTCCTACGGC AAGGTCGTCT GGCGCAGCCT GCCGCCCATG CGCCGCACGC GCGCCGAAGC CGACGCGGTC GCCTTCCTCG AGAGCCTGCC GCCACCAAGG CAAGCCGGCT GA
|
Protein sequence | MSDPAPIFAD DGPLAAAIPG FRARPQQIEM AQKIAEALRE NRVLVAEAGT GTGKTFAYLV PALLAGGKAI ISTGTKTLQD QLFNRDLPTV RAALKVPVTI ALLKGRANYV CHYHLERNAR DGRFLTAQDA ADLRAITRFA RLTQTGDKAE CTDVREDSLA WLAATSTRDN CLGQDCPHKD ECFVMQARRN AMEADVVVVN HHLFFADVML RDEGMGELLP ACNAVIFDEA HQLPETASLF FGDSVSTAQV LELARDTRSE TVAAARDCVA MIDQTRNLEK AARDLRLVFG PESARLSAAQ AGEHENFDVM VEALEKALAD FHAVLATQAE RSEGLGNCLR RTEEMSERLA SWRKPEDKEL IRWVEVFTQS LALNATPLHV SDVFKRQLEG HPRAWIFTSA TLAVGKADFG HYCRELGLAW MDPPPLTAVW GSPFDYAEQA LLYAPAGMPE PNSPDYTERV AKVALPLIRA ARGRAFVLCT SLRAMRRIHE LILDGLAQSG DALPVLLQGE GSRTELLERF RRLGNAVLVA SQSFWEGVDV PGDALSLVVI DKLPFAPPDD PVLAARVEHM QKQGLSPFVH HQLPKTVINM KQGAGRLIRS ERDRGVLCIC DPRMIDKSYG KVVWRSLPPM RRTRAEADAV AFLESLPPPR QAG
|
| |