Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0174 |
Symbol | |
ID | 7085271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 203028 |
End bp | 205391 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643697216 |
Product | hypothetical protein |
Protein accession | YP_002353865 |
Protein GI | 217968631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCCG GCGCGCCCGC CCCCGCCGAC GACGCCTTCG ACGCGGTCTT CGCGATCCGC CTGCCGCCCC GCCCCTATCC CGGCCTGCGC CCCTTCGAGC AACATGAGTG GCCGATCTTC TTCGGCCGCG AGCGCATGAC CGACGAGATC GTCGACCGCC TGCTCGGCCA TCGCCTGCTC GTGGTCCATG GCGACTCGGG CTGCGGCAAG AGCTCGCTGG TGCGCGCCGG CGTGCTGCCG CGGCTGGAGC AGGAGAGCGC GCGCGGCGGC GTGCGCTGGC GTACCTGCAC CACACTGCCG CGCGCTGCGC CGCTGTGGAA CCTCGCGCGT GCGCTCGCCG CGCTCGACAC GCCCGGCGGC GACGGGCACG GGCTGGAGCT GCGCACGATC GCCTTCCGCC GCGCGCTCAA CTTCGGCCGC GAAGCCCCGG CGGCGCTCGC CGAGCTGCTC GGCTGCGGCC CGCGCAGCCA GGTCTGCATC CTCATCGACC AGTTCGAGGA GCTCTTCGAG CACGCCCGCC GCCATGGTGC GGAAGAAGCG ACCCTGCTGG CCGCCTGCCT GGTCGGCCTG CTCGATGCGC CGCCGGCCGG GCTGTACGCG GTGCTGACCA TGCGCTCGGA GTTTCTCGGC GCATGCTCGC GCTACGAGGG CTTCGCCGAG GCGGTCAATC GCACGCAATA CCTGCTGCCG CGCATGGAGC ACGACGACCT CATGCGGGCG ATCCGCGAGC CGGCGGTGCT CTACGACGGA GAGGTGACGC GCGAGCTCGC CGAGCGCCTG ATCGTGGACG GCGGCGGCGG ACAGGACCAG CTGCCGCTGA TCCAGCACGG GCTGATGCTG CTCCACGACG AGCGTGCGCG TGCCGCGGGC CTGGCCCTTG CCGGACCGAC CCCGGGCGCG CCGGCGTGGC GGCTCGGGGT GGAGCATTAC CACGCCGAGC ACGGCCTCGC CGGCCTGCTC TCGGCACACG CCGACGCCGT GCAGGCGCGC GCCGAGCAGC AATGCCTCGG CGGCGAGGCG ACGCGGGTGG TGGAAGACCT CTTCCGCGCG CTCACCGACA TCAACGCCGA GGGCCAGGCC ATCCGCCGGC CCTGCCCGCT CGCACGCCTG GTGGCGGTGA CCGGCGCCGA GGAGTCCGTA CTGCGCTGCG TGGTGGACAC TTTCCGTGCC GACGGCGTGT CCTTCCTCGA GCCCTACGGG CACGAGGCGC TCCCCGCCGA CGAGCTCATC GACATCAGCC ATGAGGCCCT GATCCGCTGC TGGCGGCGCA TCGCCGAGCC GCGCGAGGGC TGGCTGGCGC GCGAGTTCCG CAACGGCCTG GTGTGGCGCG CGCTGCTGGT GCAGGCCGAC AGCTTCGAGC GCGACCCGGG CAACGTGCTC GGCGCCACCA CCACCGACGA GCGCGAACGC TGGCTGCGCC GGCGCAACGC CGCCTGGGCG GAACGCTACG GTGGCGGCTG GGAGCGCGTG CAGCGCCTGA TCGCCGCCAG CGTCGAGGCG CGCGCCCTGC GCGCGGCCGA GCGCGAGGTC GCCGAAAAGC AGCGCCAGGA GGCACGCCGC CTGCGCCGGC GCAGCCGCGC GCTCGCCGTG GGCCTCAGCG TGCTCGCCTT CACCAGCGCG CTTGGCGGAC TCGCCTGGCA GCAGTCGCGG ATCGCCGAGG CGGCGCGCCG GTTCGCCGAA GGCGAGCGCG AGCTCGCCCT CACCGCACGC GACGAGGTGG TGCGCCAGCT CGAGGCCCTG GTCGAGGCGC GCGAGGCCGA GGCCGCCGCA CGCGCTGCGG CCGAGGCGGT GGCGCAGGAG GCGCGGCAAT CCGCCACCGC GCTCGCCAGC GTGGTCGACG AACTCGAACG CGCCACCACC GCCGCGAACG CCCCTGCGAG CGGCGACGGA CAGGCTGTGC AGCGCAGCCT CACCGAGGCC AAGTCGGAGC TGCGCGCCCA GGTGAGCAAT CTCTCCAATC TCTCGGCGCA GCAGGCCGCG CCGCCGGTCT CCGCGGCGCC CGCTCCGGCG TCCGGCGGCG TGGCACCGCG GCTGTGGATC TACATCGCCG AGGAGGGCCA GCGCCCGACC GCCGAGGCGC TGGAGGCGCG CCTGCGCAGT GTGCGCATCG GCGACGCCGC GCTCGAGCTC CCCGGCATCG TGCTGGTCAA GGCCGCCCCC GCGCGCAGCA TGCTGCGCTG CTTCCGTACC GAGGAATGCC GCCAGGACGG CGAGGCCCTC GCGCGTGCGC TCGCCGCGCT GCTGCAGTCG CCCACGCCGA GCCTGGAGGA TTTCAGCGCG CTCTACGGCA GCAGCGGTTC GGTCCGCGCA CGCCATTACG AGCTCTGGTT CGCCCCCGGG GCGATCGTGC TGTCGGCACG CTAG
|
Protein sequence | MNPGAPAPAD DAFDAVFAIR LPPRPYPGLR PFEQHEWPIF FGRERMTDEI VDRLLGHRLL VVHGDSGCGK SSLVRAGVLP RLEQESARGG VRWRTCTTLP RAAPLWNLAR ALAALDTPGG DGHGLELRTI AFRRALNFGR EAPAALAELL GCGPRSQVCI LIDQFEELFE HARRHGAEEA TLLAACLVGL LDAPPAGLYA VLTMRSEFLG ACSRYEGFAE AVNRTQYLLP RMEHDDLMRA IREPAVLYDG EVTRELAERL IVDGGGGQDQ LPLIQHGLML LHDERARAAG LALAGPTPGA PAWRLGVEHY HAEHGLAGLL SAHADAVQAR AEQQCLGGEA TRVVEDLFRA LTDINAEGQA IRRPCPLARL VAVTGAEESV LRCVVDTFRA DGVSFLEPYG HEALPADELI DISHEALIRC WRRIAEPREG WLAREFRNGL VWRALLVQAD SFERDPGNVL GATTTDERER WLRRRNAAWA ERYGGGWERV QRLIAASVEA RALRAAEREV AEKQRQEARR LRRRSRALAV GLSVLAFTSA LGGLAWQQSR IAEAARRFAE GERELALTAR DEVVRQLEAL VEAREAEAAA RAAAEAVAQE ARQSATALAS VVDELERATT AANAPASGDG QAVQRSLTEA KSELRAQVSN LSNLSAQQAA PPVSAAPAPA SGGVAPRLWI YIAEEGQRPT AEALEARLRS VRIGDAALEL PGIVLVKAAP ARSMLRCFRT EECRQDGEAL ARALAALLQS PTPSLEDFSA LYGSSGSVRA RHYELWFAPG AIVLSAR
|
| |