Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2696 |
Symbol | |
ID | 7873438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2920085 |
End bp | 2920990 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699619 |
Product | type II secretion system protein |
Protein accession | YP_002889675 |
Protein GI | 237653361 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2064] Flp pilus assembly protein TadC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0581937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCCG CGCACTTGCT TGTGCTCGAC GAACGCACGG TGATCGCGGC GGTGGCCCTG CTCGCCGGTT TCGGCAGCGC CGGCATCGTC GCCGCGCTGG CCCGCAGCGT CAGCGCCGTC CGGCCCGAGG ACCGCACCTG GCTCGACACG CCACCGCGCT TCCTGCGCCT GCTGTGGTGG CCCACGCGCT GGATCGCGCA CCTGGTCGCA CCGCTGCTGC CGGCGCGGCT GCAGGCACGG CTGGCCGTGC GACTGCGCCT TGCCGGACTC GACTACACGC TCTCTCCGGC GCAGCTGCTC GCACACCGGG TCGCCGTCGG GCTGTGCGGG CTGGCCGCCG GCCTCGGGTG TGCCCACGCA TGGGCACTTC CTCCGGCGCT TCCCGCGCTG GCGGGCGCCA CCGCCGGCAC ACTGATCGCG TGGTCCTGGC TGGACGATCG CATCCGCTGC CGACGCCGGC TGATGCTCAA GCAGCTCCCC TTCGTGCTCG ACCTCATCAC CCTGTGCGTC GAGGCCGGCC TCAACCTCAC CGGGGCACTC CAGCAGGCGG CCGCGAAAGG CCCTGCCGGA CCGCTCGGCG AAGAACTGCA CCGCGTGCTG CGCGACGTGC GTGCGGGCAA GTCGCGCGCC GACGCGCTGC GCGGTTTCGC CGACCGCATC GGCGAACCGG CGATCGCGAA CCTGGTCTCC ACGGTCATTC AGGCCGAGAA CATGGGCATG AGCCTGGGCC CGATGCTGCG CGCGCAGGGC GAGCAACGCC GCGCGGAGCG CTTCGCCCGC GCCGAGAAGG CGGCGATGGA AGCGCCCGTG AAGATGCTGC TCCCACTGAT CGCCTGCATC TTCCCCTGCA CCTTCATCGT GCTCGGCTTT CCGATCGCGG TGAAGTTCAT GACCATGGGG CTGTGA
|
Protein sequence | MSSAHLLVLD ERTVIAAVAL LAGFGSAGIV AALARSVSAV RPEDRTWLDT PPRFLRLLWW PTRWIAHLVA PLLPARLQAR LAVRLRLAGL DYTLSPAQLL AHRVAVGLCG LAAGLGCAHA WALPPALPAL AGATAGTLIA WSWLDDRIRC RRRLMLKQLP FVLDLITLCV EAGLNLTGAL QQAAAKGPAG PLGEELHRVL RDVRAGKSRA DALRGFADRI GEPAIANLVS TVIQAENMGM SLGPMLRAQG EQRRAERFAR AEKAAMEAPV KMLLPLIACI FPCTFIVLGF PIAVKFMTMG L
|
| |