Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0783 |
Symbol | |
ID | 7084175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 867786 |
End bp | 868748 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697807 |
Product | protein TolA |
Protein accession | YP_002354448 |
Protein GI | 217969214 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain [TIGR02794] TolA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0122921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGACC GCCCCGCACC GCACCGCCGC GAGCCGCCTG GCAAGTGGCA GTCCCTCGGG TTGACCGTGG CGGTCCACCT GGGATTGTTC CTGTTCCTGT TCTTCGGCAT CCGCTGGCAG AGCCCGCCGC CCGCGGCGCT CGAGGTCGGG CTGGCTTCGG CACCGGAGCG CAGCGCGCCA GCCGCAGCGC GCCCGCAGCC GGCGCCGGAG CCCCGGCCGG AGCCGAAACC CGAACCGAAG CCCGAGCCCA AACCCGAACC GAAGCCCGAG CCCAAACCGG AACCGAAGCC GGAACCGAAG CCGGAACCCA GACCTGAACC GCCCAAGCCC CAGGCCAAGC CCGAGATCGT CGCCAAGGCG CCCGAGAAGA AGCCGGAGCC GCCCAAGCCC GAGCCACCGA AACCCGAACC CAAACCACAA CCGAAGCCGG AGCCCAAGCC TGAACCGAAG CCCGAGCCCA AGCCCCAACC CAAGCCGGAG CCGAAACCCG AGCCCAAGCC GCAGGCCAAG CCCGAACCCA AGCCCCAGCC CAAGCCGGAC ACCAAGCCGA TCGACGACTA CATGGCGCAG CGCCTCGCGC AGGAGACCCA GCGCGCCGAG CAGGCCCGCC TGTCCAGCCT GATGGCGCAG GAAGGTGCGC GCGCGGCCGC GGCCGGTCCG AGCCGCCAGG GTCCGCCCGG TGGCGACATC GACAAATACC GCGCGGCGAT CGCTGCCAAG GTGCGCGGCA ACCTGTTGCG CCCGCCCGGC CTGTCCGGCA ACCCGGAGGC GGTGTTCGAG GTCGACCAGC TGCCGAGTGG CGAAGTGCTC AATGTCCGCC TCAAGCGCTC GTCCGGCGTG CCCGCCCTCG ACGAGGCCAT CGAGCGCGCG ATCCGGCGCT CGAGCCCGCT GCCGCTGCCG GACAACAAGA ACCAGTTCGA GCGTACCCTC GAACTGAAAT TCCGTCCGCT GGCGGACGAT TGA
|
Protein sequence | MRDRPAPHRR EPPGKWQSLG LTVAVHLGLF LFLFFGIRWQ SPPPAALEVG LASAPERSAP AAARPQPAPE PRPEPKPEPK PEPKPEPKPE PKPEPKPEPK PEPRPEPPKP QAKPEIVAKA PEKKPEPPKP EPPKPEPKPQ PKPEPKPEPK PEPKPQPKPE PKPEPKPQAK PEPKPQPKPD TKPIDDYMAQ RLAQETQRAE QARLSSLMAQ EGARAAAAGP SRQGPPGGDI DKYRAAIAAK VRGNLLRPPG LSGNPEAVFE VDQLPSGEVL NVRLKRSSGV PALDEAIERA IRRSSPLPLP DNKNQFERTL ELKFRPLADD
|
| |