Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3478 |
Symbol | |
ID | 7872984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3812186 |
End bp | 3813394 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643700418 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002890449 |
Protein GI | 237654135 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCCG CCGACACCCT GCGCTTCGCC ACCCGCGCCG CCACCGCGTG GCCGCTGCGC ACCGCGTTGA TGATGCTGGC GATGTCGATC GGCGTCGCCG CGGTGGTGGT ACTCACCGCG CTCGGCGACG GCGCGCGGCG CTACGTGGTG AACGAGTTCT CCGCGCTCGG TGCCAGCCTG CTGATCGTGC TGCCCGGACG CACCGAGACC GGTGGCGTCA ACGCGGGCAG CTTCGTCAGC AGCACGCCGC GCGAGCTCAC CAACGCCGAC GCCGCCGCAC TGTTGCGCTT GCCGCTGGTG CAGCGCGTGG CGCCGCTCGC GGTGGGCAAC TCCGAGATCG CCGTCGGCGG CCGGCTGCGC GAGGTCACGG TCGTCGGCAC CAGCGCCGAC TTCCTCGAGC TGCGCGGTCT GAAGATCGGC CACGGCGGCT TCCTGCCGCG CGAGGACTTC AACCGCGCCT CGGCGGTGGC TGTGATCGGC GACGCGCTGC GCAGCGAACT CTTCCCCGGA CAGAGCGCGG TGGGCCGCAT GATCCGTGTC GGCGACACCC GGCTGCGCGT GATCGGCGTG CTCACGCCAT CCGGGCGCGG GCTGGGCATG ACCACCGACG AGCTGGTGCT GGTGCCGGTG GCGACCGCGC AGGCGATGTT CGACACCAGC GGGCTGTTCC GCATCTTCGT CGAGGCGCGC GGGCGCGAGG CCTTGCCCGC GACGCAGCGC CAGATCGAGG AGCGCCTGCG CGCGCGCCGC GACGACGAGC TCGACTTCAC CGTGATCACC CAGGACGCCG TTCTCGGCAC CTTCGACCGC ATCCTCGGGG CGCTCACCCT GGGCGTGGCC GGCATCGCCG CGATCAGCCT GGCAGTCGCG GGCATCCTGG TGATGAACGT GATGCTGGTC GCGGTCACCC AGCGCACCGC CGAAATCGGC CTGCTCAAGG CGCTCGGCGC GCGCGCCGGC ACCATCCGCG CCGCCTTCCT CGCCGAGGCC GCGCTGCTCT CCGTCGCCGG CGCGCTCGCC GGCTTCGCGC TCGGCCACGC CGGCGCCTGG GGCGTGCGCC TGGCTTTCCC GCAGCTGCCG GCGTGGCCGC CCGACTGGGC GGTGATCGCC GCGCTCGCCA CCGCGCTCGG CACCGGGGTG CTGTTCGGCG TGCTGCCCGC GCGCCGCGCC GCCCGGCTCG ATCCGGTGCA GGCCTTGTCG AAGCGGTAG
|
Protein sequence | MSPADTLRFA TRAATAWPLR TALMMLAMSI GVAAVVVLTA LGDGARRYVV NEFSALGASL LIVLPGRTET GGVNAGSFVS STPRELTNAD AAALLRLPLV QRVAPLAVGN SEIAVGGRLR EVTVVGTSAD FLELRGLKIG HGGFLPREDF NRASAVAVIG DALRSELFPG QSAVGRMIRV GDTRLRVIGV LTPSGRGLGM TTDELVLVPV ATAQAMFDTS GLFRIFVEAR GREALPATQR QIEERLRARR DDELDFTVIT QDAVLGTFDR ILGALTLGVA GIAAISLAVA GILVMNVMLV AVTQRTAEIG LLKALGARAG TIRAAFLAEA ALLSVAGALA GFALGHAGAW GVRLAFPQLP AWPPDWAVIA ALATALGTGV LFGVLPARRA ARLDPVQALS KR
|
| |