Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0887 |
Symbol | |
ID | 7084745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 981281 |
End bp | 983158 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697910 |
Product | type II and III secretion system protein |
Protein accession | YP_002354550 |
Protein GI | 217969316 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02519] pilus (MSHA type) biogenesis protein MshL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0606978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATGCA CATCGACCGA CCTGCGCAGG CTGCTCGCGA CCGCGGCGCT TCTGGCGACC GCTGCATGTG CGCCGCTCGG CCGCCCGCCC GCACTCGAGG GGCATCTCGA CAAGCGCACC GCGATCGAGC GCACCGCGCC TGTCGGCACG GCGACAAGAA GCCTCGCGCC GCCGATCGCG GAGTCGCCTT CTTCCAGCCG AGGTGCTGCA GCGAGCGCGG CGACGGGCGC GCCTCCGCCC GCTCCGGCCC GCGGGCGAGC AGCTCCCGCG GTGCGCGGAT ACTCGATCGA CGTTTTCGAC ATCCCGGCCG AGCAGCTGCT GCTCGCCATC GGCCGCGACG CCGGGCTGGA GCTCGACCTC CATGCCGGAA TCACCGGAAA CGTCAGCCTG CAGGCCCACG ACCGCCCGCT CGGCGAGCTG CTGGAGCGCA TCGCCCGCCA GGTCGACCTG CGCTATGAGC TGCGCGGCCG CCACCTGCGC GTGCGCCCCG ACACCCCCTA CATCCGCAGC TATGCGGTGG ACTATGTGAA CCTGAGCCGC GACATGCACG GCACCGTCGC CACCAGCACG CAGATCGCCA CCTCGGGCAG CGCCGGCGGC GCCGGTCAGC GCGCGTCTTC GACAGGCGGC GCTGGCAACG CCTCGATCAC CCAGATCGAG ACGCGCTCGG CCAACGACTT CTGGAACACG CTCGAGCACA ACCTCGCCGC CATCCTCGCC GGGCCGGGGC GCGCGCCCGA CTGCGGGGGC GAGCGCCGCG GCGAGTCCCC CGCCGGCTGC CGCAACGCCG ACCTGATGCT CAACCGCGAG AGCGGCATCG TCATGGCGCG CGCCACCCAG CGCCAGCACG AGCAGCTGGA GGCCTTCCTC GCGGGGGTGC AGCGCGCGGC GCGGCGCCAG GTGATGATCG AAGCCACGAT CGTCGAGGTC ACCCTCGCCG ACGGCTACCG CCAGGGCATC GACTGGACCC GGCTCGGCGT GCCGAGCTGG AGCGTGCGCC CGCGCGGCAG CGGCGACGCC ACATCCGGCC AGCCCACGCT GAGCTACCTG TCGGCCAACT CCGACATCCG TCTCGAACTG CTGGAGACCT TCGGCACCGT CAAGGTGCTC TCCAGCCCGC GCCTGTCGGT GCTCAACAAC CAGACCGCGA TGCTCAAGGT CGTCGAGGAG GTCGTCTACT TCCTCGTAGA CAGCACCACC ACCGAGTACG ACGACCGCAA GCAGGCCCGC ATCTCCGCGA CCACCACGCC GCAATCGGTC TCGGTCGGCA TGGTCATGTC GGTCACGCCA CAGATCTCCG CCGACGGCGA CATCACCCTC AACGTGCGGC CGACCATCTC GGGCATCTCC GGCTTCAAGG ACGACCCCAA CCCCTCGCTC GGCGACATCC CCAACCGCGT GCCGCAGATC CGCACACGCG AAATCGAGTC CATGCTGCGC CTGCGCAGCG GCGAGGTCGC CGTCCTCGGC GGGCTGATGG AGGACCGCAT CGACCACGAC ACCGGCCGCA TCCCGCTGTT CGGCGACCTG CCCGTGGTCG GCGAACTATT CACCCGCCGC GACAACAACG TGCGCCGCAC CGAGCTGCTC ATCTTCCTGC GCCCCCTGCT GATCGACGAG CCGGGCATGC GCGGCGGCTA CGCCGAATAC GCCGGCCACC TGCCCGACGA CGACTTCCTC CTCGACTCGC CCTCGCCCGG CCAGCGCAAC TTCCCCCACC TGCCGCTGCG CCCGCTGCTC GCCCCCGCAG GCGCGGTCTC CGCCGCCGAA GGCTTTTCGG AAGCGGCCTA TCCCGACCCT GCCGATCCCC GCTCTCCTGC CGCCTCCTCC ACGTCCGCCA GCGCCGCCAA CGCGGATGCC GGCACGAGCC CGCCATGA
|
Protein sequence | MTCTSTDLRR LLATAALLAT AACAPLGRPP ALEGHLDKRT AIERTAPVGT ATRSLAPPIA ESPSSSRGAA ASAATGAPPP APARGRAAPA VRGYSIDVFD IPAEQLLLAI GRDAGLELDL HAGITGNVSL QAHDRPLGEL LERIARQVDL RYELRGRHLR VRPDTPYIRS YAVDYVNLSR DMHGTVATST QIATSGSAGG AGQRASSTGG AGNASITQIE TRSANDFWNT LEHNLAAILA GPGRAPDCGG ERRGESPAGC RNADLMLNRE SGIVMARATQ RQHEQLEAFL AGVQRAARRQ VMIEATIVEV TLADGYRQGI DWTRLGVPSW SVRPRGSGDA TSGQPTLSYL SANSDIRLEL LETFGTVKVL SSPRLSVLNN QTAMLKVVEE VVYFLVDSTT TEYDDRKQAR ISATTTPQSV SVGMVMSVTP QISADGDITL NVRPTISGIS GFKDDPNPSL GDIPNRVPQI RTREIESMLR LRSGEVAVLG GLMEDRIDHD TGRIPLFGDL PVVGELFTRR DNNVRRTELL IFLRPLLIDE PGMRGGYAEY AGHLPDDDFL LDSPSPGQRN FPHLPLRPLL APAGAVSAAE GFSEAAYPDP ADPRSPAASS TSASAANADA GTSPP
|
| |