Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1706 |
Symbol | |
ID | 7084126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1916378 |
End bp | 1918003 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643698727 |
Product | NERD domain protein |
Protein accession | YP_002355357 |
Protein GI | 217970123 |
COG category | [R] General function prediction only |
COG ID | [COG3972] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.516506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAAC TCTTCCCAAC GATCGACGCC CTTCGCCACG ACGGCGGCCT GTATCGCGAG CTCGATGTCC TCGAACGGCT GCAGCAGTCG CTTCCCGACG GCTATGAGAT CTTCCACAGC GTCGCCTGGC AGACCGCGCA TCACGGTGAG GGCCGGCACG GTGAGATCGA CCTCGTCGTC CTGGCGCCGA GCGGCAACAT CCTCCTGGTC GAGGTCAAGG CCGGCGAGCT TTCGCTGGTC GAGGGCAATC TCGTCAAGCT CTACGGGCAA CGCGAACACG ATGTCGCCCG CCAGACCCGC GTGCAGCATG CGGCCATGCT CAACCGCCTT GCCGAGGCCG GTCTGCATGC GCACGTCACG GGCTGCGTCG TCCTTCCCGA TCATCAGGTG GAAGAGGCCC GCATCGTCTC GATGCCGCGT GAGCGCATCA TCGACGCGAA TGACTACGCG CACCTTGGTA CCCGTGTGCG AGAGCTGCTC GAACAGGGGA GCAGTCGCAG CGACGTGGAA TCGCTCCGCC GCTTCCTGGG CAACGTCTTT AAGGTCTCGA TCGACCTTCA GGTGCTCGGT GACCAGGTGC GCCAGACGAG TCGCCGCCTC GCCGACGGCC TTGCGACCTG GGTGCCGCGC ATCGCCTCGC CGTCCGGCGT CTTACGTATT CAGGCGACGG CAGGCTCCGG CAAGACGCAG CTTGCGCTGC GCCTGCTCGA CGACGCCGCC GCCGCAGGGC AGCGCGCGCT TTACGTCTGC TTCAACCGCA CGCTGGCAGA CCACATCGGC CGCATCGCGC CGGCGCGTGC GCGGGTGTCG AGCTTTCATG AGCTCTGCGT GGAGCACTGG CGGCGCACGC AGGGTGAGCC GGACTTCACC GCCGAGGGCA TCTTCCAGGC GGTCGTCGAG CGCTACGGCA GCGACGCGCA GGACTTCGAG GCGCTGTACG ACCTCGTGAT CATCGACGAA GGGCAGGACT TCGACCCCGC CTGGGTCGTG AGCCTGCTGC CGCAATTGAA GGAAGATGGG CGGCTTTACC TGCTCGAGGA CGAGGCGCAG CGTCTCTACG AGCGGGACGG ATTCGATCTT GACGGCGCCG TCAGCGTGCG CTGCAACGAC AACTTCCGCT CGCCGCGCGC CATTGTCGAT GTGATCAACG CCCTGGGCTT GGCAGGGGGA ACCGTCGAGG CACGTAGCCC CTACGTAGGC GAGCTCCCGA CCTTCCGCGC GTACGACGAC GAGCGTGGCC TGCGCCGCGA GACGCTGGCG GCGGTGGAGG CGTTGCGCGA GCGCGGGATT CCGCTTGACG ACATCGTCGT CCTGAGTGCG CGCGGACACG GGCGTTCTCT CCTGCTCAAG GAGGCGAAGC TGGGGGCGTT CGGGCTGCGG AGGTTTCTGG GGCGGTACAC GGCCGACGGG GAACCGGTGT GGTCCGAGGG CGAGCTCGTG ATCGAGTCCG TGCATCGGTT CAAAGGGCAG AGCGCCATGG GGGTCGTGCT GACGGAGGTG GATTTCGAGC AGTTCGACGA GGCTGTGCGC AGGCGTCTGT TTGTGGGGAT GACGCGGGCG CAGTTGGGGT TGGAGGTGGT GGTGTCACGA GCGGCTGAGG CGGCGCTGAG CGGGGCGCTT GCGTAA
|
Protein sequence | MAQLFPTIDA LRHDGGLYRE LDVLERLQQS LPDGYEIFHS VAWQTAHHGE GRHGEIDLVV LAPSGNILLV EVKAGELSLV EGNLVKLYGQ REHDVARQTR VQHAAMLNRL AEAGLHAHVT GCVVLPDHQV EEARIVSMPR ERIIDANDYA HLGTRVRELL EQGSSRSDVE SLRRFLGNVF KVSIDLQVLG DQVRQTSRRL ADGLATWVPR IASPSGVLRI QATAGSGKTQ LALRLLDDAA AAGQRALYVC FNRTLADHIG RIAPARARVS SFHELCVEHW RRTQGEPDFT AEGIFQAVVE RYGSDAQDFE ALYDLVIIDE GQDFDPAWVV SLLPQLKEDG RLYLLEDEAQ RLYERDGFDL DGAVSVRCND NFRSPRAIVD VINALGLAGG TVEARSPYVG ELPTFRAYDD ERGLRRETLA AVEALRERGI PLDDIVVLSA RGHGRSLLLK EAKLGAFGLR RFLGRYTADG EPVWSEGELV IESVHRFKGQ SAMGVVLTEV DFEQFDEAVR RRLFVGMTRA QLGLEVVVSR AAEAALSGAL A
|
| |