Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3624 |
Symbol | |
ID | 7873129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3981170 |
End bp | 3982885 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700564 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_002890594 |
Protein GI | 237654280 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCC AGCTCGCCCG CATCGCCCGC TCCCTCTGCC TGGCCTTCGC GCTCGGCGCC GGCGGGGTCG CGACGGCGGC CAGCGGCGAG GCCGGCGAGG GCAACCTGCC CGCGCAGGAA CTCACTCCGC GCACGCTGTA CCACTTCCTG CTCGCCGAGA TCGCCGGCGC GCGCGGCCAG ATCGGCCTGT CGGCACAGCT CTATCTCGAC CTCGCGCGCA GCACGCGCGA CCCGCGCATC GCCCGCCGCG CCACCGAGAT CGCGATGTAC TCGCGCAACC TCGTGATGGC GCGCGGCGCC GCGGAGATCT GGACCGAGGT CGCCCCCGAC TCGGACGAGG CGCGCCGGGT ACTCGCCGGG CTGAGCGGCG CCGGCCGCGG CGAGGACATC AACCTCGAGG CCATCCAGTT CCAGCTCGCG CGCGTGCTCG CGCAATCGAA CGGCCGCCTC GCGCAGAACC TGCTGAGCCT CGGCCATACC CTCGCCCGCG TACCCGACAA ACAGGCGGTG CGCGGCATCG TGATGCGCCT GACCGAACCC TACGTCGACA TGCCCGAGGC GCACATCGCC CGCGCGCAGG CGGCACAGGC GGTCGAGGAC AGGATGGGCG CGCTCGCTGC GGTGGATCGC GCCCTCGAGC TGCGCAGCGG CTGGGAACCC GCGGTGCTGC TGAAGGTGCA GATCCTCCAG CAGGCAGGCG CGCACACGGA GGCCCTGCGC GTGCTCGAGG CCGAGGCCGC GCGCGCGCCG GCGAGCCGGT CGCTGCGCCT GGCGAAGGCG CGTGCGCTGG TGAGCGCGCA GCGCTTCGGC GAGGCACGCG CGGCCTTCAA CCAGTTGCTC GAAGCCTCGC CGCAGGATCC CGAACTGCTC TATGCGGTGG GCCTGCTGTC GATGCAGCTC GAGGACTTCG CCGCGGCCGA GCTGCACTTC GCACGCGCGC TCGCCGCGGA GCACCCGCAA CCCGACCTCA TCCGCCTTCA CCTGGGCCAG ATCGCCGCCG ACCGCGGCGA GGGCGAGCGC GCACGCAAGT GGTTCGGCGA GATCGAGAGC GAGGACCTCC GTCCCGAGGC GGACATCCGC AGCGCCCTGA GCCTCGCGCA CGAAGGACGC ATCGAGGAGG CGCGCGCCCT GTTACGCAAC GACGTCGAGG ATCCCGACCT CGCCCGCCGC TACCTGCTCG CCGAGGCGCA GATCCTGCGC GACGCCGAGC GTCCGACCGA GGCCCTCGCG CTGCTCGATG CCGCCCTGCG CGAGAACCCC GAAGACACCG GCCTGCTCTA CGAGGCCGCC ATGCTCGCCG AGCGCATCGG TCGCATGGAC CTGCTCGAAG CGCGCCTGCG CCGCGTGCTC GAGCTGCAGC CCGATCATGC GCATGCGCTC AACGCGCTCG GTTATTCGCT CGCCGACCGC GGCCTGCGCC TGGACGAAGC CGAGGCGCTG ATCGCACGCG CGCATGCGCT CATGCCGCAA GACCCCTTCA TTCTCGACAG CCTCGGCTGG GTGCGCTTCC GCCGCGGCGA TCAGGTCGGC GCGCTCGTCC ACCTGGAGCG CGCCTATGGC ATGCGCAAGG ACGCCGAGAT CGCCGCCCAC CTCGGCGAGG TGCTATGGAC ACTCGGCCGT CGGGACGAGG CCCGACGGAT CTTCGCCGAG GCGCTCGCAG CCCACCCCGA CAACCGTCTG CTGACGGACA CCGGCCGCCG ACTGGGCATC CAGTGA
|
Protein sequence | MKTQLARIAR SLCLAFALGA GGVATAASGE AGEGNLPAQE LTPRTLYHFL LAEIAGARGQ IGLSAQLYLD LARSTRDPRI ARRATEIAMY SRNLVMARGA AEIWTEVAPD SDEARRVLAG LSGAGRGEDI NLEAIQFQLA RVLAQSNGRL AQNLLSLGHT LARVPDKQAV RGIVMRLTEP YVDMPEAHIA RAQAAQAVED RMGALAAVDR ALELRSGWEP AVLLKVQILQ QAGAHTEALR VLEAEAARAP ASRSLRLAKA RALVSAQRFG EARAAFNQLL EASPQDPELL YAVGLLSMQL EDFAAAELHF ARALAAEHPQ PDLIRLHLGQ IAADRGEGER ARKWFGEIES EDLRPEADIR SALSLAHEGR IEEARALLRN DVEDPDLARR YLLAEAQILR DAERPTEALA LLDAALRENP EDTGLLYEAA MLAERIGRMD LLEARLRRVL ELQPDHAHAL NALGYSLADR GLRLDEAEAL IARAHALMPQ DPFILDSLGW VRFRRGDQVG ALVHLERAYG MRKDAEIAAH LGEVLWTLGR RDEARRIFAE ALAAHPDNRL LTDTGRRLGI Q
|
| |