Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1628 |
Symbol | |
ID | 7084838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1823370 |
End bp | 1826144 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698648 |
Product | PEP-CTERM system TPR-repeat lipoprotein |
Protein accession | YP_002355279 |
Protein GI | 217970045 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.57702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTACCC GACCCGAACG TCGCGCAGTG CGCACGATGA CGCTCTGCGC CACGCTCCTG CTCGCCGCCT GTGGCGACAG CGGCAGCCCA CAACACTACG TCGCACTCGC CCGCGACCAC CTTGCCAACG GCGCCTATCG CGAAGCGACC ATCGAGCTGA ACAACGCACT CCAGAAGGAT CCGAAGAACC GCGAGGCACG CTGGCTGCTG GCGCAGGCGG CGCTCCAGCT CGGCGAAGCC GACAAGGCCG AACGCGATGC ACGCAAGGCG ATCGAATACG GGTTCTCGCG CACCGAAGCC CTACCCCTGC TGGCGCGGGC GATCCTGATG CAGCAGGCGC CCGACCGGGT TCTCACCGAA CTCTCCACCG CCCCGACGGA CGCGCCCGAC ACAATGCAAG CCGAGTATGC AAGTCTGCGT GGCACCGCGC TCCTGCTCAA GGGAGAACTC GACGCTGCCG AGCCCGAGTT CGGCAAGGCG CACAAGCTGG ACCCCGCCCT GCCGGAGGCC ATCGTCGGCC TCGCGCTGGC GCAGAGCTTG CGCAAGCAGT ACGACGAGGC GCGCAAGACG CTTGCACCCG CATTAGAGCG CACGCCGCCG GTCGCCGACG CGTGGTCGCT GCTCGGCGAC ATTGAAACCG AGCAGGAGCG CTTCGACGCC GCCGAGACCG CCTTCGGCCA GGCGATCCGG GCGCGTGCCC ATGTCACCCT CGAGCGCGCG AAGCGCGCCC TCGCCCGCGT GCGCCAAGGC AAGTTCGCAG AGGCCGAGGC GGACCTGAAC GCACTCGGAG CGCTTTCTCG CCATCCCTAT GCGCAGTACG TCACCGGCCT GTCCCATTTC CGCCAGCAGC GCCTGCGCGA GGCGGCTGAC GCATTCGAAC TTTCCTTGGC CGCCGATCCG AATTTCGCAC CCAACCGGGT CTATCTCGCC ATCACCCGGC TGATGCTCGG CCAGCAGGAA CAGGCGCTGG CGCACGCCGA ATTCATCCGT GCTGCCGCAC CGCAGGCTTC GGGCGCGAAC CTGCTCCTCG GCATCGCCCA GGCTGGTCAC GCCGACTACG GCCAGGCGCG CAAGACGCTC GAAGCCGCCT TGGCCTCCGA GCCGGACAAC GTCACCAATT TGCAGTTGCT GGCCACGCTG AGCCTGCTGC AGGGCGACAG CAAGACCGCA CTCTCCCATG CCCAGCGTCT CGCCACGCTG CGTCCGGACT CGACGGGCGC CATCAACCTT TTAATGATGG CGCAGTTGAT GTCGGGTGCG GAAGTGCCCG AGCCGACCGC CGGTCAGGTC GATGCGCTCC AGGCCGAGTT CATGCAGGCC CTCGAGGCCT TCCGCGACAA GCGCTTCGGC GAAGCGACAA AGCGCGCCGA GGCCCTGCGC GCAGCGCATC CCGAACAGAT CGGGCCGATC AACCTGCTCG CCGCCCTGTA TCTTTCCACC GGCCAGTGGC CCAAGGCGCG CAAGGAACTC GAGACCGTCC TGCAACGCCA GCCCGCGGAT GCGACCGCAC GCATCAACCT CGCCAAGCTC GAACTGCAAG ATCGCAACTT CCAGCGCATT AAGGAGCTCG TGAGCCCGCT CGTGCTCGCG TCGCCTTCGG CGGAAGCGCC CGCGCTCCTG CTTGTGGCCG CCGAGCATGG GCTGCAGAAC GACGTCGCCG CGGACCAGGT GCTTGAGCAG TTGGTCAAGA GCAACCCCTC GGCCACGCTT GCACGTGCCC TGGCCGCAGG CCGTGCGCTA CGAGGCAACA AGCCGGAACG CACGCTCGAA TACCTTGCCC AGTTCGACAA GGCACGCATC GAGTCCTCGC CGGCGCTCCT CGAACTGCGC GGGCGCGCTC ACCTGGCCCT CGCCCAGAAC GCCGAAGCCC TCGGCAACTT CGAGCGCTGG GCGCACTTGG CCCCCGAATC CGCTGCAGCG CACTTCCTGC ATGCCGAAAC CCTCGCTCGC CTGGGACGCA TCCGTGACGC CGAGGGCGCG CTGGTCCGGG CCGTCAAGCT CGACCCGACC AACCCCGAGG TGCGCATCGC CGAGGTACGC ATGCTGACCC GGACGGGACA ACTCGACAAG GCAAAGAGCG CTGCGCAGCG CCTGCGCAAG GATTTCGGCG ACCGCCCGGA CATCCTCGCC ACCACCGGCT GGCATGCATT GATGACCAGC GAGTTCGCCA TCGCGGCCGA CCACCTCGGG CGTGCATTCG CGCAGACGCC GAGCACAGCC CTCCTCCTCG AGAGGATGGC CGCGTTGTGG GGCATGGAGA AGCGGGATGA AGCCCTCGCG CTCATCCGGG ACTGGCTGGC CGAACACCCG CAGGACAGCG CAGCGCTGCT CCAGCTTGCC GGCGCATATC TGGAGCTCGG CCAGGACGAG GAGGCAGTCC GCATCTATCG CAAGGTGCTG GAGCTCGACC CGGCGCACGT CCCGTCGCTC AACAACGTCG CTTGGCTGTT GCGCCGCAGC CGGCCGGACG AAGCCCTTCA AACGGCCCGT CGCGCCCTCG AGCTTGCCCC CAAGGACCCG AACGTGCTCG ATACCGTCGG CATGTTGTAT CTGGACCGCA GGGACCTGAC GCAGGCAGGG TGGTACGTCG GCAAGGCGCA TGAGCAGAAC CCGCGCAACC ACCAGATCAG CCTGCACCTC GCCGAGGTCG CACACGCCAA GGGCAACACC GCAGAGGCGC TCAAGCTCAT CGACGGCGTG CTTGCAGAGG CGCGCGATCC TGCACTTCAC AAACAGGCCG AGAGCCTGCG TGCAACGCTC GAGAGCGGGC GCTGA
|
Protein sequence | MPTRPERRAV RTMTLCATLL LAACGDSGSP QHYVALARDH LANGAYREAT IELNNALQKD PKNREARWLL AQAALQLGEA DKAERDARKA IEYGFSRTEA LPLLARAILM QQAPDRVLTE LSTAPTDAPD TMQAEYASLR GTALLLKGEL DAAEPEFGKA HKLDPALPEA IVGLALAQSL RKQYDEARKT LAPALERTPP VADAWSLLGD IETEQERFDA AETAFGQAIR ARAHVTLERA KRALARVRQG KFAEAEADLN ALGALSRHPY AQYVTGLSHF RQQRLREAAD AFELSLAADP NFAPNRVYLA ITRLMLGQQE QALAHAEFIR AAAPQASGAN LLLGIAQAGH ADYGQARKTL EAALASEPDN VTNLQLLATL SLLQGDSKTA LSHAQRLATL RPDSTGAINL LMMAQLMSGA EVPEPTAGQV DALQAEFMQA LEAFRDKRFG EATKRAEALR AAHPEQIGPI NLLAALYLST GQWPKARKEL ETVLQRQPAD ATARINLAKL ELQDRNFQRI KELVSPLVLA SPSAEAPALL LVAAEHGLQN DVAADQVLEQ LVKSNPSATL ARALAAGRAL RGNKPERTLE YLAQFDKARI ESSPALLELR GRAHLALAQN AEALGNFERW AHLAPESAAA HFLHAETLAR LGRIRDAEGA LVRAVKLDPT NPEVRIAEVR MLTRTGQLDK AKSAAQRLRK DFGDRPDILA TTGWHALMTS EFAIAADHLG RAFAQTPSTA LLLERMAALW GMEKRDEALA LIRDWLAEHP QDSAALLQLA GAYLELGQDE EAVRIYRKVL ELDPAHVPSL NNVAWLLRRS RPDEALQTAR RALELAPKDP NVLDTVGMLY LDRRDLTQAG WYVGKAHEQN PRNHQISLHL AEVAHAKGNT AEALKLIDGV LAEARDPALH KQAESLRATL ESGR
|
| |