Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1917 |
Symbol | |
ID | 7085686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2162448 |
End bp | 2163653 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698942 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002355564 |
Protein GI | 217970330 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGCC TCGCCGGCCG CGACATCCTG CACGGCTGGG GCAAGTTCGT GTTCACCGGC GTCGGCCTGG GGCTGCTGAT CGGCGTCACG CTGGTGATGG CGGGCGTGTA TCGCGGCATG GTCGACGACG GCAAGGCCTT GCTCGACAAC AGCGGCGCCG ACCTCTGGGT GGTGCAGCGC GACACGCTCG GCCCCTATGC GGAGTCGTCC AGCGTCCACG ACGACCTCTA CCGCAGCCTG CTCGGCCAGC GTGGCGTGGC GGGCGCGGCG AACATCACCT ACCTGACCAT GCAGGTGCGC AAGGGCAGCG AGGACGTGCG CGCGATGGTG GTCGGCATCG CCGCCGGCCG GCCGGGGGCC ACGCCGGGCT GGCCGCCGTA TCTGGTCGCG GGGCGGCAGA TCACGCGCGG GCATTACGAG GCGGTGGCCG ACGTCGCCAG CGGCCTGCGC CTCGGCGAGC GCGTGGGCAT CCGGCGCAAC CAGTACACCG TGGTCGGGCT GACCCGGCGC ATGGTGTCCT CGGCCGGCGA CCCGATGGTG TTCATCCCGC TCAAGGATGC GCAGGAGGCT CAGTTCCTCA AGGACAACGA CGCCATCCTG CAGGGTCGCC GCCGCACTGC GGCCAACCCG GCGCTCAACC GCCCGGGCGT GCCCGGCCTG CTCGACGCGG TGCTCGCCTC GCAGAGCACC AACCCCTATG TGAATGCCGT GCTGCTCACG CTCGCGCCCG GCCACGCGCC CGACGAGGTC GCCGAATCCA TCCGGCGCTG GCAGCGCCTC ACCGTGTACA CCCGGGCGCA GATGGAGGGC ATCCTCGTCG GCAAGCTGAT CGCCACCTCC GCGCGCCAGA TCGGCATGTT CCTGGTCATC CTCGCGGTGG TGAGCGCGGC CATCGTCGCC TTCATCATCT ACACGCTGAC CATGGACAAG ATCCGCGAGA TCGCGGTGCT CAAGCTCATC GGCACGCGCA ACCGGACCAT TGCCGCGATG ATCCTGCAGC AGGCGGTGGC GCTCGGGCTG ATCGGCTTCG TCGTCGGCAA GATCGCCGCC ACCTGGTCGG CGCCCTTCTT CCCCAAGTAC GTGCTGCTGG TGCCCGCCGA CACGGTGGCG GGCTTCGCGG CGGTGATGGC GATCTGCGTG CTCGCCAGCC TGGTGTCGAT CCGGCTCGCG CTGCGCGTCG ATCCGGCCGA AGCGATCGGA GGCTGA
|
Protein sequence | MISLAGRDIL HGWGKFVFTG VGLGLLIGVT LVMAGVYRGM VDDGKALLDN SGADLWVVQR DTLGPYAESS SVHDDLYRSL LGQRGVAGAA NITYLTMQVR KGSEDVRAMV VGIAAGRPGA TPGWPPYLVA GRQITRGHYE AVADVASGLR LGERVGIRRN QYTVVGLTRR MVSSAGDPMV FIPLKDAQEA QFLKDNDAIL QGRRRTAANP ALNRPGVPGL LDAVLASQST NPYVNAVLLT LAPGHAPDEV AESIRRWQRL TVYTRAQMEG ILVGKLIATS ARQIGMFLVI LAVVSAAIVA FIIYTLTMDK IREIAVLKLI GTRNRTIAAM ILQQAVALGL IGFVVGKIAA TWSAPFFPKY VLLVPADTVA GFAAVMAICV LASLVSIRLA LRVDPAEAIG G
|
| |