Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0886 |
Symbol | |
ID | 7084744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 979569 |
End bp | 981284 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643697909 |
Product | type II secretion system protein E |
Protein accession | YP_002354549 |
Protein GI | 217969315 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.135966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCTC CGCTCCCCAT CGGCCAGATC CTCATCGCCG CCGGCCTGAT CGGCGAGGAC CAGCTCCGCA TCGCGCTCCA CGAGCAACGC GGCCGCGCCC GGCCGCTCGG CCGCGTGCTG GTCGAACTCG GCTTCGTCAG CGAGGCCGCG CTGCGCGAGG CGCTCGCCGC ACGCAGCGGC CTGCCCTGCG TCGACCTGGC CAGCGCGCTC GCCGACCCCG ACGCGATCGC GCGCGTGCCG CAGGCGCTCG CAAGGCGCCA CCGCCTGCTG CCGCTCCAGT ACGACGCCGC CCGCCACCGC CTGATCGTCG CGATGGCCGA CGCCCACGAC ATCGTCGCCC TCGACCGTCT GCGTGCCGAG CTCGGGCCGG ACGTGCACGT CGAACTGCGC CTGGCCGGCG ACAACGAACT CGGCCGCGCG ATCGAACAGC ACTACGGTCA GGCGAGCTCG ATCGAGGACA TGGTGCGCGA GCTCGAACGC CGCGCCGGAC AGCCCATCGC TGGCGCGCGC GACCCGCAAC TCGTCGTGCG CCTGGTCGAC GCCCTGCTCG CCGAGGCCGC CGCGCGCGGC GCCTCCGACC TGCACCTGGA GCCCGAGGCC GGCTTCCTGC GCGTGCGCCA CCGCATCGAC GGCACGCTGC GCCAGGTGCG CGCGATGCAC AAGTCGTGCT GGGCGGAGCT CGCCGTGCGC ATCAAGGTGC TCGCCGGCAT GGACATCGCC GAGTCGCGCA GCCCGCAGGA CGGGCGCATC GGCCTTGCGC TCGGCGGCCG GCCGATCGAC TTCCGCGTCG CCACCCAGCC CACCTTGCAC GGCGAGAACA TCGTGCTGCG CATCCTCGAC CGCGACAAGG GCATCGTCCC GCTCGACGCG CTCGGCCTCG ACCCCGCGCA ACGCGCCGCC CTCGACCGCA TCCTCGCCCG CCCCGAGGGC CTGATCCTGG TGGTCGGTCC CACCGGCAGC GGCAAGACCA CGACGCTGTA CTCCATGCTC GGCCACCTCA ACACCGAGGC GGTCAACATC ATGACTCTGG AAGATCCGGT CGAGTATCCG CTCGCGCTGA TCCGCCAGAC CCAGGTCGGC GACGCCTCAC GGCTCGACTT CGCCGACGGC GTGCGCGCGC TGCTGCGCCA GGACCCCGAC GTGATCCTGA TCGGCGAGAT CCGCGACGCC GACACCGCCA CCATGGCCTT GCGCGCCGCA CTCACCGGCC ACCAGGTGTT CGCCACCCTG CACGCCAACT CGATCTTCGG CGCTCTGCCG CGCCTGCACG AGATCGGCAT CGGCGCCGAA CTGCTCGCCG GCAACCTGTG CGGCATCCTC GCCCAGCGCC TGGTGCGCCG GCTGTGCCCC ACCTGCCGCG AGCCGGTGCC GGACGACGCC CCCGAGCGTC ACCTGCCCGG CCTCGCCGCC GATGCCCCGC CAGCGCCGAG CTGGCGCGCG CGCGGCTGCC CCGAATGCGA CTTCCGCGGC TATCGCGGCC GTCTCGCGAT CATGGAGATC CTGCCCTTCG ACGCCGAGCT CGACGAGCTC GTCGCACGCC GCGCCAGCCC GGGCGAGCTG CGTGCAGCCG CACGCACGCG CGGCCATCGC AGTCTGGCCG AGGACGGACT GCGCCGGGTG CTCGACGGCA ACACCAGCCT CGCCGAGCTC GCCCGCGTGG TCGACCTCTC CGCGCTCGCG GCTACGGCGA AGACCGCGCA CGGAGGTGAG GCATGA
|
Protein sequence | MNAPLPIGQI LIAAGLIGED QLRIALHEQR GRARPLGRVL VELGFVSEAA LREALAARSG LPCVDLASAL ADPDAIARVP QALARRHRLL PLQYDAARHR LIVAMADAHD IVALDRLRAE LGPDVHVELR LAGDNELGRA IEQHYGQASS IEDMVRELER RAGQPIAGAR DPQLVVRLVD ALLAEAAARG ASDLHLEPEA GFLRVRHRID GTLRQVRAMH KSCWAELAVR IKVLAGMDIA ESRSPQDGRI GLALGGRPID FRVATQPTLH GENIVLRILD RDKGIVPLDA LGLDPAQRAA LDRILARPEG LILVVGPTGS GKTTTLYSML GHLNTEAVNI MTLEDPVEYP LALIRQTQVG DASRLDFADG VRALLRQDPD VILIGEIRDA DTATMALRAA LTGHQVFATL HANSIFGALP RLHEIGIGAE LLAGNLCGIL AQRLVRRLCP TCREPVPDDA PERHLPGLAA DAPPAPSWRA RGCPECDFRG YRGRLAIMEI LPFDAELDEL VARRASPGEL RAAARTRGHR SLAEDGLRRV LDGNTSLAEL ARVVDLSALA ATAKTAHGGE A
|
| |