Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2694 |
Symbol | |
ID | 7873436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2917798 |
End bp | 2919222 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643699617 |
Product | type II secretion system protein E |
Protein accession | YP_002889673 |
Protein GI | 237653359 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00162868 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTCC GCACCGCACG CAAACCGCCC GAAGACTCGT CCACCACCCT GGCCGCAGCG GCGGATACGC CGGCCGCAAC GCCCGCGCCC GTACCCGAGC CTCCGTTCCG CCCGGGCTTC GAACCACTCC CGGCAGCTGC CGAAGCAAAC GCGGATTTCA CCTGGCGCAA ACGCATCCAC GAACGCCTGC TCGACACCAT CGACCTGCGC CGGCGCGATC TCAACCGCAT GTCGGACGAC GAGTTGCGCG CCGAGACCAC CGCGCTCGTA CGCGAGATCA TCGCCGCCGA GGCCGCGCTT CCAGGCGGGC TCGACCGCGA ACAGCTGTGC AGCGAGGTGC TCGACGAGGC CATCGGCCTC GGACCGCTCG AGACCCTGCT CACCGACGAG TCGGTGAGCG AGATCATGGT CAATCGCTTC GACCAGATCT TCGTCGAACG TGGCGGCCGC ATCGCGCCGC ACCCCACCAC CTTCACCAGC GACCGCGCGG TGCTCGGCGT CATCGAGCGC ATCGTCGCGC CGCTCGGCCG CCGCATCGAC GAATCCTCGC CGATGGTCGA TGCCCGGCTG CGCGACGGCT CGCGGGTCAA TGCGATCATC CCGCCGCTCG CCCTCAAGGG CCCGACGCTC ACCATCCGCA AGTTCGCGCG TCGGGCGCTC GAGGTCGCTG ACCTCGTCCG CATGGGTTCG CTCTCGCACG AGATGGCCGC ATTCCTGCGC ACCTGCGTCG AGCAGCGGCG CAACATCGTG GTCTCCGGCG GCACCGGCTC GGGCAAGACC ACCTTCCTCA ACCTGCTGTC GAACTTCATC CCCGACGGCG AACGCATCAT CACCATCGAG GACGCCGCCG AGCTGCGCCT GCGCCACAGC CACCTGGTCA GCCTGGAGGC GCGCCCGTCC AACCTGGAAG GACGCGGCGC GATCAGCATC CGCGACCTCG TGCGCAACGC GTTGCGCATG CGCCCCGACC GCATCGTGGT GGGCGAATGC CGCGGCGGCG AGGCGCTCGA CATGCTGCAG GCGATGAACA CCGGCCACGA GGGCTCGATG ACGACCCTGC ACGCGAACAG CCCGCGCGAT GCGCTCGCCC GCCTCGAGAC GCTCGTCCTG ATGGCCGGCA TGGATCTCCC GCTGGCCGCA ATCCGCGAGC AGATCGCCAG CGCGGTGGAC ATCATCGTCC AGCAGACCCG CTTCGCCTGC GGCGCGCGCC GGCTCACCAG CATCACCGAG CTCACCGGCA TGGAGGGCGG ACGCATCCAG CTGCAGGAAC TCTTCCGCTT CGAGCGCCGC ACCGCCGATT CCCCCACCGC CGACCCGAGC GGCGGCCACT TCACTGGCTG CGATGCCGTG CCGGGCTTCT ACGACGAACT GCGTCGCCAA GGGGTCGCGC TCGACCTGCA CCTGTTCGAC CGGAAGCGCT CATGA
|
Protein sequence | MFFRTARKPP EDSSTTLAAA ADTPAATPAP VPEPPFRPGF EPLPAAAEAN ADFTWRKRIH ERLLDTIDLR RRDLNRMSDD ELRAETTALV REIIAAEAAL PGGLDREQLC SEVLDEAIGL GPLETLLTDE SVSEIMVNRF DQIFVERGGR IAPHPTTFTS DRAVLGVIER IVAPLGRRID ESSPMVDARL RDGSRVNAII PPLALKGPTL TIRKFARRAL EVADLVRMGS LSHEMAAFLR TCVEQRRNIV VSGGTGSGKT TFLNLLSNFI PDGERIITIE DAAELRLRHS HLVSLEARPS NLEGRGAISI RDLVRNALRM RPDRIVVGEC RGGEALDMLQ AMNTGHEGSM TTLHANSPRD ALARLETLVL MAGMDLPLAA IREQIASAVD IIVQQTRFAC GARRLTSITE LTGMEGGRIQ LQELFRFERR TADSPTADPS GGHFTGCDAV PGFYDELRRQ GVALDLHLFD RKRS
|
| |