Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2693 |
Symbol | |
ID | 7873435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2916346 |
End bp | 2917785 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699616 |
Product | type II and III secretion system protein |
Protein accession | YP_002889672 |
Protein GI | 237653358 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4964] Flp pilus assembly protein, secretin CpaC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000595981 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGGA CTGACCCGAT CTCCCCGCCG GCGCGCCGGC GGGCGCTCCC TCCCCTCGTG TGCGCAATCC TGATCCTGGG CCTGCCCGGC CCGAGCCGCG CCGCTCCGCC CGCAAGCGCC GCGCCGCCGG AACCTGGGCG CGCAGCGCTC GCGGTCGTCC ACGAGATCCA GATGTTCGCC GGCGAAACCC GGGTGCTGCC GCAGCGCGAC GCGCAGCGCC TCGCCGTCGG CGACGCCAAG GTGCTCTCCG CGGCCGTGCT CGACGACCGC GAGATCCTGT TGATCGCCAA CGGCCCGGGC GACACCATGC TGCAGGTGTG GACCCGCTCG GGCCGCAGCC AGCGCATCAA GATCACCGTC CGCCAGACCG ACACCGCCCG CATCGCACGC GACCTGCAGG GCTTCCTGCA CGACGTCCCG CGGCTGCGCA CGCGCACCAT CGGCGACAAC GTGGTCATCG AGGGCGAGGG CCTGAGCGAC GCCGAACGCG ACAAGGTCAC CGAGCTCGCC AAGCGTTTCC CGCAGCTCAT CGACTTCACC AGCCGCGTCG GGCTCGACCG CATGTTCGCC TTCGACGTGC GCTTCGTGGA GATCAGCCGC AGCGGCCTGC GCGACCTCGG CATCGACTGG ACCACGCAGG GCAAGCCGCT GCTCGGGATC GGCGTGATCG GCGACCTGTA CCACGACAAC CGCACCGGGG CGACCGTGAC GCGCACGCTC GGCAGCGGCG AACTGAGCTC CGAGGTCGAG ATCGAGGCGA ACCGCGTCTC GCCCTTCGCC GCCAACGTGG CCCTGGTCGG CAGCTTCTTC GGCCGCCTCA ACCTGCTCGC GCAGAAGGGC GACGCGGTCA TCCTCGCCTC GCCGCGACTC TCGGCGCGCA ACCGCGGCGA AGCGAGCTTC CTCGCCGGCG GTGAAATCCC CGTCCCGGTC GCCTCCGCCA CCGGGACGCC GAGCGTGACG TTCAAGGAAT ACGGCATCCG CCTCAACATC CTCGAACCGG TCGCCGACGC CGCCGGAACG ATCCGCGCGC GCATCCGCAC CGAAGTCAGC TCGCTCGACC GCAGCGTCGC CGCCCAGGGC GTTCCCGGCC TGCTCTCGCG CCGAACCGAG ACCGAGTTCA ACCTGCGCAA CGGCGAGACC CTGGTGCTCT CCGGGGTGCT GCAGCGCGAA CAGCAGAGCG ACGAGAATGC GCTCCCGCAT TTGGGTGAGC TCCCGGTAAT CGGTCGACTG TTCCGGTCCA CCCGGTTCAA CAGCCGCGAA AGCGAACTCG TGGTCTTCGT CACCCCCTGG CTGCTCGAGC CCGGCGAAGG CCTTCTGGAT CCGCGCGCGC AACTGCTCGA ACGCCGTGCC GAGCAGGCCC TCGCCCCCGC GCCCGAACCC TCGCCGGCGC TGCGCGAGCC CGCGGATCCC GCGTGGAACG CCCTCTACGG CGGCAACTGA
|
Protein sequence | MTRTDPISPP ARRRALPPLV CAILILGLPG PSRAAPPASA APPEPGRAAL AVVHEIQMFA GETRVLPQRD AQRLAVGDAK VLSAAVLDDR EILLIANGPG DTMLQVWTRS GRSQRIKITV RQTDTARIAR DLQGFLHDVP RLRTRTIGDN VVIEGEGLSD AERDKVTELA KRFPQLIDFT SRVGLDRMFA FDVRFVEISR SGLRDLGIDW TTQGKPLLGI GVIGDLYHDN RTGATVTRTL GSGELSSEVE IEANRVSPFA ANVALVGSFF GRLNLLAQKG DAVILASPRL SARNRGEASF LAGGEIPVPV ASATGTPSVT FKEYGIRLNI LEPVADAAGT IRARIRTEVS SLDRSVAAQG VPGLLSRRTE TEFNLRNGET LVLSGVLQRE QQSDENALPH LGELPVIGRL FRSTRFNSRE SELVVFVTPW LLEPGEGLLD PRAQLLERRA EQALAPAPEP SPALREPADP AWNALYGGN
|
| |