Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1622 |
Symbol | |
ID | 7084832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1816784 |
End bp | 1818025 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643698642 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002355273 |
Protein GI | 217970039 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAGA TCGCCGTCGG CCGGCTCCGA CTGGCCTTTT CCTTGGCCGC CCGCAACCTC TTCAGGCAGC GTCGGCGTGC GTTGCTCGCG CTCGCGATCA TCAGCGGTGG GGTGATCACC TTCCTGCTGG CTGGCGGCTT CATCCACTGG TTGCTCGGCA ACATGCGCGA GGCGACGATC CACTCGCAGC TCGGCCATGC CCAGATCATC CGCCCGGGCT ATTTCCGCGA GGGGCTCGGC GATCCCTACC GCTATCTGCT GCCGCCCGGC ACCTCGGCCG TAGAGGCACT TTCGCTCAGG GGGCTGCGCA CGGTTTCACC GCGTCTGGCA TTCAACGGTC TGCTGTCGAG TGGCGACACG ACGGTGAGCT TCATGGGCGA GGGCATCGAT CCTGTGCGGG AGACGGCGAT CATGCGTTCG ATCCGGATCG TGCGGGGTCG CAACCTGCAG GTCGCCGACG AACAGGCGGC CATCATCGGG GCCGGGCTTG CCGCCAACGT GGGCGTCGAA CCGGGGGACC GCATCGTGCT GCTGGCGACG GGCGCGTCCG GCGGACTGGG CGCGGTCGAA CTCGAGGTCG CCGGCCTGTT CGCCACCACC GCGAAGGCCT ACGACGACGC CGCGCTGCGG GTGCCGATCG AGGTCGCCCG CAAGCTGATG GGGGTGGATG GCGCGACGAG TTGGGTGGTG CTGCTCGAGG ACACCGACGG CACGGCGTCG GCAGTGGCAA GCCTGCGCGC TGGACTCGCC GCCGAATCCT TCGAAGTCAT TCCCTGGCAC GATCTCGCCG ACTTTTACAA CAAGACGGTC GAACTGTTCA CCCGCCAAGT GGGGGTCGTG CGGCTGCTGA TCGCCTTTAT CGTGGTGCTG AGCATCTCTA ACACGTTGTC CATGGCGGTG TTCGAGCGCA CCAGCGAGAT CGGCACGGTG ATGGCGCTCG GCACCCGCCG ACGTGGCGTG CTCGCCATGT TCATCACCGA AGGCGCCCTG CTCGGCGTGC TCGGCGGCGT TCTCGGCGTC ACCCTGGGCA GCCTACTGGC GCTTGTCATT TCGTACGTTG GCATCCCGAT GCCGCCTCCG CCGGGCATGG ACATCGGCTT CACCGGGCGC ATCTCGGTGT CGCCTGCGCT CGCACTCGAT GCCTTCGTAC TGGCCTTCCT GACCACGCTG CTGGCGAGCG TCATGCCGGC GCTGCGGGCG TCGCGGATGA ACATCGTCGA CGCGCTGCGT CACCAGAGGT GA
|
Protein sequence | MQQIAVGRLR LAFSLAARNL FRQRRRALLA LAIISGGVIT FLLAGGFIHW LLGNMREATI HSQLGHAQII RPGYFREGLG DPYRYLLPPG TSAVEALSLR GLRTVSPRLA FNGLLSSGDT TVSFMGEGID PVRETAIMRS IRIVRGRNLQ VADEQAAIIG AGLAANVGVE PGDRIVLLAT GASGGLGAVE LEVAGLFATT AKAYDDAALR VPIEVARKLM GVDGATSWVV LLEDTDGTAS AVASLRAGLA AESFEVIPWH DLADFYNKTV ELFTRQVGVV RLLIAFIVVL SISNTLSMAV FERTSEIGTV MALGTRRRGV LAMFITEGAL LGVLGGVLGV TLGSLLALVI SYVGIPMPPP PGMDIGFTGR ISVSPALALD AFVLAFLTTL LASVMPALRA SRMNIVDALR HQR
|
| |