Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0879 |
Symbol | |
ID | 7084737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 970919 |
End bp | 971908 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643697902 |
Product | KpsF/GutQ family protein |
Protein accession | YP_002354542 |
Protein GI | 217969308 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00879092 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCAA TTTCCGCTTC GGGCGAAAGC CGAGCCGTCG CCCTCGCCCG CCGCGTGCTG CGCATCGAGG CAGACGCCGT GGCCGCGCTC GGGGCGCGCA TCGGCGAGGA ATTCGAGCGC GCGGTCGAGA TCATCCTCGC GCGCCATGGC CGGGTGATCG TCACCGGCGT TGGCAAGTCG GGCCACATCG GCCGCAAGCT CGCCGCCACG CTCGCCAGCA CCGGCACCCC GGCCTACTTC GTGCATGCCG CCGAGGCCGC ACACGGCGAT CTCGGCATGA TCACGCCGGA AGACGTGGTG ATCGCACTGT CGAACTCCGG CTCCAGCGAG GAGCTGCTCA CCATCGTGCC CCTGGTCAAG CGCCAGGGCG CCCGCCTGAT CGCGATGACC GGCAAGCCGG ACTCGCCGCT CGCACGCGAG GCCGATGCGC ATCTGGACGC CGGCGTGGCC GAGGAGGCCT GCCCGCTCAA CCTCGCGCCC ACCGCGAGCA CCACGGCGGC GCTCGCGCTC GGCGACGCGC TCTCGGTCGC GCTGCTCGAC GCCCGCGGCT TCGCCGCCGA GGATTTCGCC CGCTCCCACC CCGGCGGCGC GCTCGGCCGC CGGCTGCTGA CCCACGTCGG CGACGTCATG CGTCCCGCGC CGGCGGTCCC GCGCGTGGGC AGCGACGCCC CCCTCACCCA GGCCTTGCTG GCGATGACCG CCGGCGGCAT GGGCATGACT GCGGTGGTCG ACGCCGACGA GGTGCCCGTG GGCATCTTCA CCGACGGCGA CCTGCGGCGC GCGCTGGAGA AGGGCTGCGA CGTGCGCAGC GCGCGCGTCA GCGAGGTCAT GACGCGCAGT CCGCGCAGCA TCGCGCCCGG CGCGCTCGCG GCCGAGGCCG CCGCGACGAT GGAGAACATG CGCATCAGCC AGCTCCTGGT GCTCGACGAC GCCGGCCGGC TCGCCGGTGC GCTCACCACC CACGACCTGA TGCTCGCGAA GGTCATCTGA
|
Protein sequence | MDPISASGES RAVALARRVL RIEADAVAAL GARIGEEFER AVEIILARHG RVIVTGVGKS GHIGRKLAAT LASTGTPAYF VHAAEAAHGD LGMITPEDVV IALSNSGSSE ELLTIVPLVK RQGARLIAMT GKPDSPLARE ADAHLDAGVA EEACPLNLAP TASTTAALAL GDALSVALLD ARGFAAEDFA RSHPGGALGR RLLTHVGDVM RPAPAVPRVG SDAPLTQALL AMTAGGMGMT AVVDADEVPV GIFTDGDLRR ALEKGCDVRS ARVSEVMTRS PRSIAPGALA AEAAATMENM RISQLLVLDD AGRLAGALTT HDLMLAKVI
|
| |