Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3577 |
Symbol | |
ID | 7873082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3919652 |
End bp | 3921835 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700517 |
Product | protein of unknown function DUF323 |
Protein accession | YP_002890547 |
Protein GI | 237654233 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.296354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCCG CATCGCAAGC GACCCCTGGC GACCGTCCTT CCCCCGCTTC CTCTTCCGCC GGCAGCCCTG CGCCCCTCTT CCCGCGCACC CCGCTGCTCT CCGGCACCGA TGTCGAGGCC AAGCGCGCCG AGCTGCTGCA TTACTTCCAC GCCACCTTCG ACCGCTACGA GTCGCTGTTC GAGGTGCTCA CCTGCGACGA GGCCTACTAC CGGAAACCGA TCAGCCTGCG CCACCCGCTG ATCTTCTACT TCGGCCACAC CGCCACCTTC TTCATCAACA AGTTCATGCT CGCCGGCCTG ATCGAGCAGC GCATCGACCC GCGCCTGGAA TCCATGTTCG CCGTCGGCGT CGACGAGATG AGCTGGGACG ACCTCGACGA CGCGCGCTAC GACTGGCCGA GCGTGGCCGA AGTGGCCGCC TACCGGGGCA AGGTGCGCGC GGTGGTCGAG CGCGTGATCC GCGACACGCC CTTCACCTTG CCGATCGGCT GGAAGGACCC GTTCTGGGCG GTGGTGATGG GCATCGAGCA CGAGCGCATC CACCTCGAGA CCTCCTCGGT GCTGATGCGC CAGCACGCCC TGCAGTACGT GAAGCCGCAT CCCGCCTGGC AGCCCTGCGC CGAGCATGGC GAGCCGCCCG CCAACGACCT GGTCGACATC CCCGCGGGCA CGGTCACGCT CGGCCGCAGC TTCGACGAGC CGATCTACGG CTGGGACAAC GAGTACGGCA CGCACCGCGC CGAGGTGCCG GCCTTCCAGG CCGCCCGCCA TCTGGTGACC AATCGCGAAT ACCTCGGCTT CGTCGAGGCC GGCGGCTACG CCGACGACAG CGTCTGGGAC GAGGAGGGCC TGGGCTGGCG CAGGTTCGCC CGCGCCGAGC ATCCGACCTT CTGGGTCCCC GACGCCGCGG GCTGGAAGCT GCGCCTGATG ACCGAGGAAG TGCCGATGCG CTGGGACTGG CCGGTGGAGG TCAACTGTCT GGAGGCGCGC GCCTTCTGCC GCTGGAAAGC GCGCGAGACC GGCCAGCCGG TGCGCCTGCC CACCGAGGAC GAGTGGAACC GCATCCACGA CCACGCGGGC CTCGCCGATG TGCCGCACGA CGCCCCCGCC AACGCCAACC TGCATCTGGA CCACTGGGCC TCGAGCTGTC CGGTGACGCG CTTCGCCCAC GGCGAGCTCT TCGACGTGGT CGGCAACGTC TGGCAATGGT GCGAGACCCC GACCTATCCC TTCGACGGCT TCGACGTGCA CCCGATCTAC GACGACTTCA CCACGCCGAC CTTCGATGAC CGCCATGCCA TCATCAAGGG CGGCAGCTGG ATTTCCGCCG GCAACGAGTC GCGCCACGCC TCGCGCTACG CCTTCCGCCG CCATTTCTTC CAGCACGCGG GCTTGCGCTA CGTCGTCAGC GAGGCGCCGG TTATCAACCC GGCCTCGAGC TACGAGACCG ACACCCTGCT GTCGCAGTAC GCCGAGTTCC ACTACGGCGA CGAGGCCTTC GGCGTGCCCA ACTTCCCGAA GGCGCTCGCC GACATCGCGA TCGCGGCCCA GCGCCGCTTC GGCAAGGGCC GCTTCGGCCG CGCGCTCGAC CTCGGCTGCG CCACCGGTCG CGCCAGCTTC GAGCTGGCGC GCGCCTTCGA GCGCGTGGTC GGCATCGACT TCTCGGCGCG CTTCATCCAG GCTGGCGTCA AGCTCGCCGA GACCGGCGTG CTGCGCTACA CGATGCCCGA CGAGGGCGAG CTGGTCACCT ACCACGAGCG CCGGCTCGAC GCCCTCGGGC TGGCCGGGAC CGCCGGCCGC GTCGAGTTCT GGCAAGGCGA CGCGTGCAAC CTCAAGGACG TGTTCACCGG CTTCGACCTC ATCCTCGCCG CCAACCTGAT CGACCGCCTG TACAGCCCGC GCCGCTTCCT CGCCGACGTG CCGCGGCGGC TCAAGCCCGG CGGCCTGCTG CTGCTCGCCT CGCCCTACAC CTGGCTGGAA GAGCACACCA AGCGCGAGGA GTGGATCGGC GGCTTCAAGA AGGATGGCGA GTCGTACACG ACCCTCGACG GGCTCAAGGA CCTGCTCGCC ACCGACTTCG AGCTGGTGCA GGGGCCGCAG GCGGTGCCCT TCGTGATCCG CGAGACACGG CGCAAGCACC AGCACACGCT GTCGGAGCTG ACCGTCTGGA GGAAGCGCTC CTGA
|
Protein sequence | MNPASQATPG DRPSPASSSA GSPAPLFPRT PLLSGTDVEA KRAELLHYFH ATFDRYESLF EVLTCDEAYY RKPISLRHPL IFYFGHTATF FINKFMLAGL IEQRIDPRLE SMFAVGVDEM SWDDLDDARY DWPSVAEVAA YRGKVRAVVE RVIRDTPFTL PIGWKDPFWA VVMGIEHERI HLETSSVLMR QHALQYVKPH PAWQPCAEHG EPPANDLVDI PAGTVTLGRS FDEPIYGWDN EYGTHRAEVP AFQAARHLVT NREYLGFVEA GGYADDSVWD EEGLGWRRFA RAEHPTFWVP DAAGWKLRLM TEEVPMRWDW PVEVNCLEAR AFCRWKARET GQPVRLPTED EWNRIHDHAG LADVPHDAPA NANLHLDHWA SSCPVTRFAH GELFDVVGNV WQWCETPTYP FDGFDVHPIY DDFTTPTFDD RHAIIKGGSW ISAGNESRHA SRYAFRRHFF QHAGLRYVVS EAPVINPASS YETDTLLSQY AEFHYGDEAF GVPNFPKALA DIAIAAQRRF GKGRFGRALD LGCATGRASF ELARAFERVV GIDFSARFIQ AGVKLAETGV LRYTMPDEGE LVTYHERRLD ALGLAGTAGR VEFWQGDACN LKDVFTGFDL ILAANLIDRL YSPRRFLADV PRRLKPGGLL LLASPYTWLE EHTKREEWIG GFKKDGESYT TLDGLKDLLA TDFELVQGPQ AVPFVIRETR RKHQHTLSEL TVWRKRS
|
| |