Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3493 |
Symbol | |
ID | 7872999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3831967 |
End bp | 3833262 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643700433 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002890464 |
Protein GI | 237654150 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCCG TCTTCATGCT GTCGCTGCGC AGCATCGCCA ACCGCAGGCT GTCGGCCGCG CTCACCGTGG TCGCGGTGGC GCTGGCAGTG GCGCTGCTGC TCGGGGTCGA GCGCCTGCGC AACGATGCGC GCGCGGGCTT CGCGCAGACG ATTTCCGCCA CCGACCTGGT GGTGGGCGCG CGCAGCGGCC CGGTGCAGCT GCTGCTGTAC TCGGTCTTCC ACATCGGCGA CGCCACCGCC AACCTGTCGT GGGCCAGCAT CGAGCACGTC GCCGCGCTGC CCCAGGTGAA GTGGCTGGTA CCGCTCGCGC TCGGCGACTC GCACCGCGGT CACCGCGTGG TCGGCACCAC CACGGACTTC TTCGCGCAAT ACCGCCATGG CGAGGGGCGC AGTCTGGTCT TCGCGGCCGG CGGCCCGTTC GCCGGCGCAA CCGCGCGCGT CGAGGACCTC TTCCAGACCG TGATCGGCGC CGAGGTCGCC GCCCGCCATG GTTACCGCCT CGGCGAGCGC ATCGTGCTCA GCCACGGCGG CGGCGCGGGG GAGGGCGGAG CGCGCTTCGC CGAGCACGCC GACAAGCCCT TCACCGTGGT CGGCATCCTC GCGCCCAGCG CCACGCCGCT CGACCGCGCC GTGCTGGTGA GCCTGGAGGG CCTGGAGGCG ATTCATGTCG ATTGGCATGG CGGCGCGCCC ATCCCGGGCC TGAAGATCAC GCCCGAGCAG GTGCGCAAGT TCGACCTGCG CCCCAAATCC GTCACGGCCG CGCTCGTCGG GCTGCACAGC CGCAGCGCGG TGTTCCGCGT GCAGCGCCAG ATCAACGCCT ACGCCGGCGA ACCGCTCACC GCCATCCTGC CCGGTGCGAC CCTGCAGCAG CTGTGGGACC TCGTCGGCAT CGCCGAGCGC GCGCTGCTCG CGGTCTCGGC CCTGGTGGTC GTGGTCGGCC TGACGGGCAT GGTGGCGGTG GTGCTGGCAA GCCTGGGCGA GCGCCGCCGC GAGCTCGCCA TCCTGCGTGC GCTGGGTGCC AGCCCGCGCG AGGTCTTCGC GCTGCTCGCG CTTGAGAGCC TGCTGCTCGC CACTGCGGGC ATCGCGCTCG GCCTCGGGCT GCTGTATGGT GCGGGCGCCG CGCTCGCCCC CTGGCTCGCC GCGCAGCATG GTCTGCAGCC GAGCCTGGGC TGGCCCGCGG CGGGCGAATG GCGCCTGCTC GGCGCGGTGC TCGCCGCCAG CCTGGTCGCC AGCCTGCTGC CGGCGCTGCG CGCCTACCGC CAGTCGCTCG CCGACGGCAT GACCGTGCGG ACCTGA
|
Protein sequence | MSPVFMLSLR SIANRRLSAA LTVVAVALAV ALLLGVERLR NDARAGFAQT ISATDLVVGA RSGPVQLLLY SVFHIGDATA NLSWASIEHV AALPQVKWLV PLALGDSHRG HRVVGTTTDF FAQYRHGEGR SLVFAAGGPF AGATARVEDL FQTVIGAEVA ARHGYRLGER IVLSHGGGAG EGGARFAEHA DKPFTVVGIL APSATPLDRA VLVSLEGLEA IHVDWHGGAP IPGLKITPEQ VRKFDLRPKS VTAALVGLHS RSAVFRVQRQ INAYAGEPLT AILPGATLQQ LWDLVGIAER ALLAVSALVV VVGLTGMVAV VLASLGERRR ELAILRALGA SPREVFALLA LESLLLATAG IALGLGLLYG AGAALAPWLA AQHGLQPSLG WPAAGEWRLL GAVLAASLVA SLLPALRAYR QSLADGMTVR T
|
| |