Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0948 |
Symbol | |
ID | 7085051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1038420 |
End bp | 1041308 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697970 |
Product | hypothetical protein |
Protein accession | YP_002354610 |
Protein GI | 217969376 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.711504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTGGT CGCTGCCGTG GTCACGCAAG TCCGACGCAT CGCCTGTTGA CGCCGCGGAT ACGACCGATG ATGCCTGGGC ACGGCACGTG ACGGCCTTGG CGGCACAGGG TGTCGCCGAG CCGGGCAGCG CGCTCGGCCG GGGCCGGCGC AGACCGGCCA CCCAGGCCGA CCACGATGCG CTCTATGGCG TCGCGCCGTC GTTCGCGGAC TTGCTGCCCT GGGTCGAGTA CCTGCCCGGC AGCAAGTGCA TGTTGCTGGA AGACGGCCAG TCGGTGGCGG CCTTCTTCGA ACTGGCGCCG GTCGGCACCG AGGGCCGCGA GATGGCCTGG CTGTGGCAGG CGCGCGATGC GCTGGAGAAC GCCCTGCAGG ACTCCTTCGA CGAGTTGGAC GACAACCCCT GGGTAGTGCA GCTCTACGCC CAGGACGAGG CCGACTGGGA CAACTATCTG CGCTCCCTGG CGAACTATCT GCAGCCGCGT GCGCAGGGCA GCGCGTTCAG CGACTTCTAC CTGCGCTTCT TCGCCCATCA CCTGCGGGCC ATCGCCAAGC CGGGTGGCCT GTTTGAGGAC ACCACGGTGA CGCGCCTGCC GTGGCGCGGC CAGGTGCGGC GCGTGCGCAT GGTGGTCTAC CGCCGCACGT CCGCGGCCAC GACCCCGCGG CGTGGCCAGT CGCCCGAGCA GGCGCTGACC ACGATCTGCG ACCGCCTCGC CGGCGGGCTG GCGAATGCCG GCGTGAAGGC TCGGCGTCTC GGCGCGGCGG ACATCCACGC CTGGCTGCTG CGCTGGTTCA ACCCGAATCC GACCTTGCTC GGCACCACTG CCGAAGATCG AGAACGCTTC TACGCGCTGA CCCGCTACCC GGAAGAGCAG GAGGAGGGCG AGATCGAACT CGCCAGCGGC ACCGACTTCG CGCAGCGCCT GTTCTTCGGC CAGCCCCGTT CGGACGTGCC CAACGGCCTG TGGTTCTTCG ACGGCATGTC GCATCGGGTG ATCGTGATGG ATCGCCTGCG CACGCCACCC GTGACGGGCC ATCTGACGGG CGAGACGCGC AAAGGCGGCG ATGCCATGAA CGCGCTGTTC GACCAGATGC CCGAAGACAC GGTGATGTGC CTGACGCTGG TCGCCACGCC CCAGGACGTG CTGGAGGCAC ACCTCAACCA CCTCGCCAGG AAGGCCGTCG GCGAGACCCT GGCCTCGGAG CAGACCCGGC AGGACGTGCA GCAGGCACGC GGCCTGATCG GCAGCGCGCA CAAGCTCTAC CGCGGCGCGC TGGCGTTCTA CCTGCGCGGC CGCGACCTGG CCCAGCTCGA TGCGCGCGGC CTGCAGCTCG TCAACGTGAT GCTCAACGCC GGCCTGCAAC CGGTACGCGA AGAGGACGAG GTGGCGCCGC TGAACAGCTA TTTGCGCTGG CTGCCGTGCG TGTTCGATCC GGCTTCCGAC AAGCGCCAGT GGTACACACA GCTCATGTTC GCGCAACACG TGGCGAACCT GGCGCCGGTC TGGGGCCGCA GTCAGGGCAC GGGGCATCCG GGCATCACGT TCTTCAACCG CGGCGGGGGC CCGATCACCT TCGATCCGTT GAACCGCCTC GACCGGCAGA TGAACGCGCA CCTGTTCCTG TTCGGCCCCA CGGGTTCGGG CAAGAGCGCG ACGCTCAACA ACATCCTGAA CCAGGTCACG GCGATCTATC GGCCGCGCCT GTTCATCGTC GAGGCGGGCA ACAGCTTCGG CCTGTTTGGC GACTTCGCGG CACGGCTGGG TCTCACCGTG CATCGGGTGA AACTTGCACC CGGCGCGGGC GTCAGTCTGG CTCCGTTCGC CGACGCCTGG CGCCTGGTCG ATACGCCGAG CCAGGTACAG ACGCTGGACG CCGATGCGCT CGACGAAGAC CAGACCGATG CCGGCATGGC CGTGGAGGGC GACGAGCAGC GCGACGTGCT CGGCGAGCTG GAGATCACTG CACGACTGAT GATCACCGGC GGCGAGGACA AGGAAGAAGC GCGCATGACG CGCGCCGACC GCAGCCTGAT CCGCCAGTGC ATTCTCGATG CGGCACAGCG CTGCGTAGCG GAGAAGCGCA CGGTACTGAC CCGCGATGTG CGCGACGCGC TGCGCGAGCG CGCCCGCGAT GCGACGCTGC CGGAGATGCG GCGCGCACGG CTGCTGGAGA TGGCCGACGC CATGGATATG TTCTGCCAGG GCGTGGACGG CGAGATGTTC GACCGGTCCG GCACGCCGTG GCCCGAGGCG GACATCACCA TCGTGGATCT CGCCACCTTC GCGCGCGAGG GCTACAACGC CCAACTCTCG ATTGCCTACA TCTCGCTCAT CAACACCGTC AACAACATCG CCGAGCGCGA CCAGTTCCTG GGCCGCCCGA TCATCAACGT GACGGACGAA GGCCACATCA TCACGAAGAA CCTGCTGCTT GCGCCCTACG TGGTCAAGAT CACCAAGATG TGGCGCAAGC TCGGCGCCTG GTTCTGGCTC GCCACGCAGA ACCTGGACGA CCTGCCGAAA GCGGCCGAGC CCATGCTCAA CATGATCGAG TGGTGGATCT GCCTGTCGAT GCCGCCCGAT GAAGTCGAGA AGATCGCGCG CTTCCGCGAA CTCAACGCTT CGCAGAAGGC GCTGATGCTC TCGGCACGCA AGGAGGCCGG CAAGTTCAGC GAGGGCGTCA TCCTGTCCAA GTCGATGGAA GTGCTGTTCC GCGCTGTGCC GCCCAGCCTC TACCTGGCGA TGGCGATGAC CGAGCCCGAG GAGAAGGCCG AACGCTTCCA GTTGATGCAG CAGCACGGCA TCAGCGAGCT GGATGCCGCC TTCCGCGTGG CCGAGAAGAT CGACCGCGCG CGGGGCATCG AACCGCTGGC GCTGGACACG TTGGCCTGA
|
Protein sequence | MAWSLPWSRK SDASPVDAAD TTDDAWARHV TALAAQGVAE PGSALGRGRR RPATQADHDA LYGVAPSFAD LLPWVEYLPG SKCMLLEDGQ SVAAFFELAP VGTEGREMAW LWQARDALEN ALQDSFDELD DNPWVVQLYA QDEADWDNYL RSLANYLQPR AQGSAFSDFY LRFFAHHLRA IAKPGGLFED TTVTRLPWRG QVRRVRMVVY RRTSAATTPR RGQSPEQALT TICDRLAGGL ANAGVKARRL GAADIHAWLL RWFNPNPTLL GTTAEDRERF YALTRYPEEQ EEGEIELASG TDFAQRLFFG QPRSDVPNGL WFFDGMSHRV IVMDRLRTPP VTGHLTGETR KGGDAMNALF DQMPEDTVMC LTLVATPQDV LEAHLNHLAR KAVGETLASE QTRQDVQQAR GLIGSAHKLY RGALAFYLRG RDLAQLDARG LQLVNVMLNA GLQPVREEDE VAPLNSYLRW LPCVFDPASD KRQWYTQLMF AQHVANLAPV WGRSQGTGHP GITFFNRGGG PITFDPLNRL DRQMNAHLFL FGPTGSGKSA TLNNILNQVT AIYRPRLFIV EAGNSFGLFG DFAARLGLTV HRVKLAPGAG VSLAPFADAW RLVDTPSQVQ TLDADALDED QTDAGMAVEG DEQRDVLGEL EITARLMITG GEDKEEARMT RADRSLIRQC ILDAAQRCVA EKRTVLTRDV RDALRERARD ATLPEMRRAR LLEMADAMDM FCQGVDGEMF DRSGTPWPEA DITIVDLATF AREGYNAQLS IAYISLINTV NNIAERDQFL GRPIINVTDE GHIITKNLLL APYVVKITKM WRKLGAWFWL ATQNLDDLPK AAEPMLNMIE WWICLSMPPD EVEKIARFRE LNASQKALML SARKEAGKFS EGVILSKSME VLFRAVPPSL YLAMAMTEPE EKAERFQLMQ QHGISELDAA FRVAEKIDRA RGIEPLALDT LA
|
| |