Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2056 |
Symbol | |
ID | 7085326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2327561 |
End bp | 2329588 |
Gene Length | 2028 bp |
Protein Length | 675 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643699080 |
Product | hypothetical protein |
Protein accession | YP_002355697 |
Protein GI | 217970463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.12326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACGC GTGAAAATCC ACTAGGCAGA CAATTGCCCT GTCTGCTGAG AACTGGGAAT GAAGATCTAG ATGTTTGGCC CACTCCAGAT GATGGAGTCC TCACAGAGTC CGATCGAAAA AAATATCTGA TGAGAAAAGA AGCCGTAAGA CTATACCTCG CAGGCTACGG CGATAGCTTA ATTCGAGAAA AATGCGAGAT CGGACTTAAA CAGACTTACC GATTGATCAC TGAGCGATGC CTTAAGACCC ATCCCGATGG TCGCATATAT GGATTCCGTG GGTTAATACC GAAACTAAGA ATCAAACCTT ACAAGCGAAA GCGAAAAGTA AAAGTCAACG AATTTGGTGG AGGTGCTGTC GGTGCGATGG GCCTGGTTCT TGATTTGAAT CCAGATCTAC GTAGATCCTT CGATAAGAAA ATATTGGAGT CCGCCAGTTC AAGCACCCTA TCTCTGATAA GAAAGCCGCG CCAAGCTCTA TGGAAGTGGT TCTTAGATCG CCTGCGAGAC TTAGGCTATG AGGTTAGGGG CGAGTGGCCG TTCAATACAA TCACTAATGG ATATAATTCC GTATGCCGAT ACATCGATCT TGTTTACAAA TTAAATCCAA AAATGGCCGC GCGAGCCATT GGTGGCAAAA CCCTTGAAAG AAAGTTGATT ACCGGGGATG GCGTGGACCG TCCCGTAAAG AGGGTGCTGC AGCGGGTTGA GATGGACGCC CATAAACTAG ATGGGCGATT CTGTGTATTG ATGCCCAATG GTGATGGCGA TCATGTGCCG CGAATCGTCC ACAGGTTATG GGTGATCGTA ATTTTGGATG TTGAATCACG AGCCGTTCTG GGATACCACT TTGCATTAGG AAAAGAGGTT TCTAAGACTG ACGTCCTTCG AGCGATTAAA TCTGCTCTGA CTCGATGGCG ACGACGAATA TTATCTCCCG GCCTCCAACA TATCTATTGG GATAATGCTA ATTTTCCTTC AGGGATGTCG GACAGATATG TTGGCATGTG CTGGGACGAA ACGAGCGTTG ACGGCGCTCT CGCCGAGACG TGCAAAAGGG TAGAACAGTT TCTAAGGGAC GTCGTAAGTT CGAGATTAAT TTCGCCAAAG GAGGGATTTG CATCCCGGCG AGGCCTTGAT GAGCGCCCAT TTATTGAAAC ATTCTTTCGA GTTCTTGCAG GCGGGGGGTT GCACAAAATG TCAAATACCA CAGGCTCAAA ACCCAAGGAT CGTAAGGGGC GGGAGCCGGA GGAGGTTGCA TTGAATAGTC AGTTTCAACT TGAATATGCG GAGGACTTAA TTGATGTCTT GATTGCGAAT TATAATGCAA CGCCCCATAC CAGTCTTGGG TATCGCACTC CATTGCAGTT TCTCGAGTAT GCAACTAATC GCCCTGACTT CTCTTTTCGA TACGCAGACC CCGAGCAGGT TAGCCTCCTG CTAAGCTTAA GAAAAAAATG CAAAGTGCAT GGCGGCGTCC AAGAAGGAAG AGCGCCTTAT GTGAATTTCG AGAACGCCAG GTATACAAAC GAGATACTTT CGCAGCGTTT TGACCTAGCG GGGAAATATA TAGACGTGAT CAATCATGAA GAGAATGACG CAAGAATCGC ACTAGCCTCC ACGTCGGACG GACAAAGCCT CGGCGTTCTT CGTGCTGGTC CGCCATGGCA TAGATTCCCC CATAGCTTGC GAATTAGGGC CGCCATTAAA GGCATGATAA GGAAGAAAAT GTTCTACGTG GCATCAGGCA CCGATGCGAC CGAGGCGTTT GTGGAATATT GCGAGTCTCA GTCCAATAAA AAACTTCCGC CACATCCATC TTACCTCGAA TTTCGCAGAA TAATCGCGAA CTCGGAGCGG AACGAATCCG AAGAACTCCT CAGGGTTGCG CTAGATACCA TTAGCTCGAG CGATGAGGCG GAAATTGCTC GAAGAGATCA AGCCATCGGT GGAAAGGATT CTGATAGTGA GGTGAATGAG TGCTCTCAAG GAAGTTCCCC TCGACGCCGA AAAGCAGCAT CGAGTTAA
|
Protein sequence | MKTRENPLGR QLPCLLRTGN EDLDVWPTPD DGVLTESDRK KYLMRKEAVR LYLAGYGDSL IREKCEIGLK QTYRLITERC LKTHPDGRIY GFRGLIPKLR IKPYKRKRKV KVNEFGGGAV GAMGLVLDLN PDLRRSFDKK ILESASSSTL SLIRKPRQAL WKWFLDRLRD LGYEVRGEWP FNTITNGYNS VCRYIDLVYK LNPKMAARAI GGKTLERKLI TGDGVDRPVK RVLQRVEMDA HKLDGRFCVL MPNGDGDHVP RIVHRLWVIV ILDVESRAVL GYHFALGKEV SKTDVLRAIK SALTRWRRRI LSPGLQHIYW DNANFPSGMS DRYVGMCWDE TSVDGALAET CKRVEQFLRD VVSSRLISPK EGFASRRGLD ERPFIETFFR VLAGGGLHKM SNTTGSKPKD RKGREPEEVA LNSQFQLEYA EDLIDVLIAN YNATPHTSLG YRTPLQFLEY ATNRPDFSFR YADPEQVSLL LSLRKKCKVH GGVQEGRAPY VNFENARYTN EILSQRFDLA GKYIDVINHE ENDARIALAS TSDGQSLGVL RAGPPWHRFP HSLRIRAAIK GMIRKKMFYV ASGTDATEAF VEYCESQSNK KLPPHPSYLE FRRIIANSER NESEELLRVA LDTISSSDEA EIARRDQAIG GKDSDSEVNE CSQGSSPRRR KAASS
|
| |