Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2039 |
Symbol | |
ID | 7083799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2299831 |
End bp | 2302218 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699066 |
Product | RNA binding S1 domain protein |
Protein accession | YP_002355683 |
Protein GI | 217970449 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.883528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCCCC CGATCGAATA CCGCATCGCC GAAGAGCTCG GCGTCTCGCC GCGCCAGGTC ATCGCCGCCG TCCAGCTCAT CGACGATGGC GCCACCGTGC CCTTCATCGC CCGCTACCGC AAGGAAGTCA CCGGTGGCCT GGACGACACG CAGTTGCGCA CGCTGGAGGA GCGCCTGGGC TACCTGCGCG AGCTCGAAGA CCGCCGCGCC ACCGTGCTGG CCTCGATCGA GGAGCAGGGC AAGCTCACCG CCGAGCTGCG TGCCGAGGTC GAGGACGCCG ACACCAAGCA GCGCCTCGAG GACCTCTACC TTCCCTACAA GCCCAAGCGC CGCACCAAGG CACAGATCGC GCGCGAGGCC GGCATCGGCC CGCTGGCCGA GACATTGCTC GCCGACCCGA CCCTGACGCC CGAAGTCGAA GCCGAAAAGT ACATCAACGC CGAGGCCGGT TTTGCCGACA CCAAGGCGGT GCTCGACGGC GCGCGCCAGA TCCTGATGGA GCACTTCGCC GAGGACGCCA CCCTGCTCGG CGAGCTGCGC AGCTACCTGC ACGAACATGG CCGGGTGCGC TCGAGCGTGA TCGAGGGCAA GGAGAACGAA GGCGCCAAGT TCCGCGACTG GTTCGACTTC GCCGAACCGA TCGCCACCAT GCCCTCGCAC CGTGCGCTGG CGCTGCTGCG CGGGCGCAAC GAAGGCGTGC TGCGCCTGGC ACTCGACGTC GAGCGCCCCG ACCCCGACGC CCCGCATCCC TGCGAAGGCC GCATCGCCGT GCGCTTCGGC ATCCGCAGCC AGGGCCGCCC GGGCGACCGC TGGCTCGCCG AGACCGCGCG CTGGGCGTGG AGCGTGAAGA TCTCCATCCA CCTCGAACTC GAGCTGATGA GCGAGCTGCG CGAGCGCGCC GAGGAAGAGG CGATCCGCGT CTTCGCGCGC AACCTCCACG ACCTGCTGCT CGCCGCCCCC GCCGGCACGC GCGCCACCAT CGGCCTCGAC CCCGGAATCC GCACCGGAGT GAAGGTCGCG GTGGTGGATG CCACCGGCAA GCTGGTCGAC ACCGCCACCG TGTATCCCTT CGAGCCGCGC CGCGACCGCG AATCGTCGAT CGCCACCATC GCCGCGCTGG CGAAGAAGCA TGCGGTCGAG CTGATCGCCA TCGGCAACGG CACCGCCTCG CGCGAGACCG ACGCGCTGGT GGCCGAGCTG ATGAAGCGCT ACCCGGAGCT GCAGCTGACC AAGATCGTCG TCTCGGAGGC CGGCGCCTCG GTGTATTCCG CGTCCGAACT CGCCGCACGC GAGTTCCCCG AGCTCGACGT CAGCCTGCGC GGCGCGGTTT CGATCGCCCG CCGCCTGCAG GATCCGCTCG CCGAGCTGGT CAAGATCGAT CCCAAGTCGA TCGGCGTCGG CCAGTACCAG CACGACGTCA ACCAGGGCCG CCTGGCGAAG AGCCTGGACG CCGTGGTGGA AGACTGCGTG AACGCGGTCG GCGTGGATGT GAACACCGCG TCTGCACCGC TGCTCGCACG CATCTCGGGT CTGAACGCCA CGCTCGCCGG CAACATCGTC GAGTATCGCA ATACCCGGGG CCCCTTCCGC AGCCGCAGCG CATTGAAGGA CGTGCCCCGC CTCGGCCCCA AGACCTTCGA GCAGGCCGCG GGCTTCCTGC GCATCCCCAG CGGCGACAAC CCGCTCGACG CCTCCTCGGT GCACCCCGAG GCCTACCCGG TGGTCGAGCG CATCCTCGCG CGCGTGCGGA AGAGCGTGCG CGAACTGATG GGCAACGGCG GCTTCCTCAA GAGCCTGAAA CCGGAGGAGT TCACCGACGA GCGCTTCGGG CTGCCCACCG TGCAGGACAT CCTCGCCGAA CTGGAAAAGC CCGGCCGCGA CCCGCGCCCC GAGTTCCGCA CCGCGACCTT CCGCGAGGGC GTGGAGACCT TGAAGGACCT CGAGCCCGGC ATGCTGCTCG AAGGGGTGGT CACCAACGTC ACCAACTTCG GCGCCTTCGT CGACATCGGC GTGCATCAGG ACGGCCTGGT GCATATCTCC GCGCTGTCCA ACACCTTCGT CAAGGACCCG CACAGCGTGG TCAAGGCGGG TCAGGTGGTG AAGGTGAAGG TGCTGGAGGT GGACATCCCG CGCCACCGCA TCGCGCTCAC CATGCGCCTG GGCGACGAGG TGACTGCGAA GCGCGGCGAG TCCGGCGCCC GGCGCGACGA GGGTGCGCCG CGGCGCGGGG AAGGCGGTCA GGCCCCGCGT AGCGAGCAGC GCCCGCGCGG CGGTGACAAC CGGCGCCCGC CCGCGCAGGG CAAGGGTGGC GCGCGCCAGG AGGCTTCCCG CCCCGCCGCT GCCAACGCAC TCGCCGAAGC ATTCGCCCGC GCCCGCGGCA ATCGCTGA
|
Protein sequence | MLPPIEYRIA EELGVSPRQV IAAVQLIDDG ATVPFIARYR KEVTGGLDDT QLRTLEERLG YLRELEDRRA TVLASIEEQG KLTAELRAEV EDADTKQRLE DLYLPYKPKR RTKAQIAREA GIGPLAETLL ADPTLTPEVE AEKYINAEAG FADTKAVLDG ARQILMEHFA EDATLLGELR SYLHEHGRVR SSVIEGKENE GAKFRDWFDF AEPIATMPSH RALALLRGRN EGVLRLALDV ERPDPDAPHP CEGRIAVRFG IRSQGRPGDR WLAETARWAW SVKISIHLEL ELMSELRERA EEEAIRVFAR NLHDLLLAAP AGTRATIGLD PGIRTGVKVA VVDATGKLVD TATVYPFEPR RDRESSIATI AALAKKHAVE LIAIGNGTAS RETDALVAEL MKRYPELQLT KIVVSEAGAS VYSASELAAR EFPELDVSLR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDVNQGRLAK SLDAVVEDCV NAVGVDVNTA SAPLLARISG LNATLAGNIV EYRNTRGPFR SRSALKDVPR LGPKTFEQAA GFLRIPSGDN PLDASSVHPE AYPVVERILA RVRKSVRELM GNGGFLKSLK PEEFTDERFG LPTVQDILAE LEKPGRDPRP EFRTATFREG VETLKDLEPG MLLEGVVTNV TNFGAFVDIG VHQDGLVHIS ALSNTFVKDP HSVVKAGQVV KVKVLEVDIP RHRIALTMRL GDEVTAKRGE SGARRDEGAP RRGEGGQAPR SEQRPRGGDN RRPPAQGKGG ARQEASRPAA ANALAEAFAR ARGNR
|
| |