Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3622 |
Symbol | |
ID | 7873127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3978024 |
End bp | 3979988 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700562 |
Product | hypothetical protein |
Protein accession | YP_002890592 |
Protein GI | 237654278 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.300251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC TCACCGAACA GTTTTCCGCC TACAGCCGCT GGCGCGGCCA CGCCAGCGAC GCGGTGGCCC GCCTGCGCGC GTGGCTGGCG CGCAACGACG TGGGCAACGC CCAGGCGGAC ATGCGCCTGC AATACCTGCT CGACCGCCTG CGCGACGACA AGCTCACGGT GGCCTTCGTC GCCGAGTTCT CGCGTGGCAA ATCCGAGCTC ATCAACGCGA TTTTCTTCGC CGACCACGGC GACCGCATCC TGCCCTCCAG CGCGGGCCGC ACCACGATGT GCCCGACCGA GCTGCAGTGG CAGAAGGGGG CGCAGCCCGA GCTGCGTCTG CTGCCGATCC GCAGCCGCGC GCGCCAGACG CCGGTGAGCG AGCTCAAGCG CTATCCAGAA GAATGGACCG TGATCGCGCT CGACGCGGTC GACGCCCCCG GCCTGCAGGC CGCGCTCGCG CGCGTGGGCG AGACCGAGCG GGTCGAGGTC GCCGAAGCGC AGGCGCTCGG CTTCCGCGTC GATGCCGCAG GGGACGACGG CATCCGTCCC GATGCCGAGG GTCGCATCGA GATCCCGCGC TGGCGCCATG CCGTGATCCA GTTCCCGCAT CCCCTGCTCG AGCAGGGCCT GGTGGTGCTC GACACGCCGG GCCTCAATGC GATCGGCGCC GAGCCCGAGC TCACCCTGTC GATGCTGCCG AATGCGCACG CGGTGCTGTT CATCCTCGCC GCCGACACCG GCGTCACCCA GAGCGATCTC GCGGTGTGGC GCAACTACGT GCAGGCTGGC AGCGAGCGCC AGCGCGGGCG CCTGGTCGTG CTCAACAAGA TCGACGGGCT GTGGGACGGG CTGCGCGACG AAGCGCGCAT CGAAGCCGAG ATCGGCCGCC AGGTCGACTC GGTCGCCGCG ACCCTCGGCG TCGAGCCCGC CGCGGTGTTC CCGGTGTCGG CGCAGAAGGG CCTGGTGGCG CGCATCCAGG GCGACGAGGC CCTGCTCGCG CGCAGCCGTG TGCCACAGCT CGAGCGTGCG CTCGCCGAGG GCCTGCTGCC CGCCAAGCAG GAGATCGTGC GCGAGGACAT CCTCGCCGAG TCCGCCGAGG TCGCCGCCGC CACGCGCAGC CTGCTCGAGG CGCGCCTGGC CGGCTTGCGC GAACAGCTCC AGGAACTTAC CGACCTGCGC GGCAAGAACC AGAGCGTCAT CGAATACATG ATGCGCAAGA TCCGCAGCGA GAAGGAGGAG TTCGAGCAGG GTCTGCAGAA GTACTACGCG GTGCGTACGG TGTTCACCGA CCTCGCCAAC AACCTGTTCG CGCACATGGG CCTGGACGCG GTGCGCGACG AGACGCGGCG CACGCGCGAG GCGATGCTCG AATCCAGCTT CACGCGCGGG CTGCGCAACG CCATGCAGGG CTATTTCCGC AGCCTGCGTG GCAACCTGCA GCGCTCGACC GAGGAGATCG GCGAGATCAC CCGCATGCTC GACGCCATGT ACAAGCGCTT CAGCGTCGAG CACGGCCTCA AGCTCACCGC ACCCGAAGGC TTCTCGACGC TGCGCTACGA GAAGGAGCTC GAGCGCCTGG AGACGGCCTT CAACCGCCAG ATCAACACCG CGCTGACCCT GGTCACCACC GAGAAGCACG CGCTCACGCA GAAGTTCTTC GAGACCGTCG CGGTGCAGGC GCGCCAGACC TTCGAGCTCG CCAACCGCGA CGTCGAGCAG TGGCTGCGCG CGGTGATGTC GCCGCTGGAG ACCCAGGTGC GCGAGTACCA GATCCAGCTC AAGCGCCGGC TCGAGAGCGT CAAGCGCATC CACGAGGCCA CCGACACGCT GGAGAGCCGC GTGGAGGAGT TGATGCAGGG CGAGGAGGCG CTGCGCGCGC TGCTGGACGA GCTCGACAAC CTCGAGGCCG CGGTCGCCGA CGCGCTCGAC GTGCCCGCCA ACACGCCTGT GACGGGCGTG CGTGCGGCCG CCTGA
|
Protein sequence | MTDLTEQFSA YSRWRGHASD AVARLRAWLA RNDVGNAQAD MRLQYLLDRL RDDKLTVAFV AEFSRGKSEL INAIFFADHG DRILPSSAGR TTMCPTELQW QKGAQPELRL LPIRSRARQT PVSELKRYPE EWTVIALDAV DAPGLQAALA RVGETERVEV AEAQALGFRV DAAGDDGIRP DAEGRIEIPR WRHAVIQFPH PLLEQGLVVL DTPGLNAIGA EPELTLSMLP NAHAVLFILA ADTGVTQSDL AVWRNYVQAG SERQRGRLVV LNKIDGLWDG LRDEARIEAE IGRQVDSVAA TLGVEPAAVF PVSAQKGLVA RIQGDEALLA RSRVPQLERA LAEGLLPAKQ EIVREDILAE SAEVAAATRS LLEARLAGLR EQLQELTDLR GKNQSVIEYM MRKIRSEKEE FEQGLQKYYA VRTVFTDLAN NLFAHMGLDA VRDETRRTRE AMLESSFTRG LRNAMQGYFR SLRGNLQRST EEIGEITRML DAMYKRFSVE HGLKLTAPEG FSTLRYEKEL ERLETAFNRQ INTALTLVTT EKHALTQKFF ETVAVQARQT FELANRDVEQ WLRAVMSPLE TQVREYQIQL KRRLESVKRI HEATDTLESR VEELMQGEEA LRALLDELDN LEAAVADALD VPANTPVTGV RAAA
|
| |