Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3479 |
Symbol | |
ID | 7872985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3813400 |
End bp | 3814599 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700419 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002890450 |
Protein GI | 237654136 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTGGC GCGACGCCCT CCACCTCGCC CTGCGTGCGA TCACCGCGCA TCGGCTGCGC AGCCTCCTGA CCCTGCTCGG CATCGGCGTG GGCATCGCCG CGGTGATCCT GCTCACCTCG ATCGGCGAGG GTCTGCACCG CTTCGTGCTC GCCGAGTTCG GCCAGTTCGG CACCAACGTG ATCAACCTCC ACCCGGGCCG CCAGGGCGCA CGCGGCGGCC CGCCCGGCCT GCCGTCCACC GCGCGCGACC TCACCCTGGA CGACGCGGAC GCGCTCGCCC GCCTGCCCCA TGTGCGCCAC GTCACCGGTT CGGTGAGCGG CAACGCCGAG GTGCGCGCGC AAGGCCGGGT GCGGCGCAGC ACCGTGCTCG GCGTGGGACC GCAGATGCAG GAGGTGTACT CGATGCGGGT ACGCGTGGGG CAGTTCCTGC CGCCCGACGA GGCGGCGAGC GCGCGCGCCT TCGTGGTGCT GGGCCCCAAG CTCGCACGCG AGCTCTTCGG CAGCACCATC GCGCTCGGCG AGCACGTCGA GATCGGTGCC GAGCGCTTCC GCGTGGTCGG CGTCATGGAA GAGAAGGGGC AGTTCCTCGG TATCGATCTG GACGACGCCG CCTACATCCC GGTGGTGCGC GGCATGGCGC TGTACCAGCG CGACGGCCTC ATGGAGATCG CGTTGACCTA CGACCCCGAG GCCCCCGCCG CGCGCGTGGC GGAGGCGGTG AAGAAGCGGA TGATCGCGCG CCACGGACGC GAGGACTTCA CCGTGCTGAC GCAGGAGGAC ATGCTGGCGA CGCTGTCGAA CATCCTCGAT CTGCTCACCG CGGCGGTGGG TGCGCTCGGG GCGATCTCGC TGCTGGTGGG CGGAGTGGGC ATCGTCACCA TCATGAGCAT CGCGGTCACC GAGCGCACCG GAGAGATCGG CCTGCTGGTG GCCCTGGGCG CACGTCGGCG CACCATCCTC GCGCTCTTCC TCGGCGAGGC GGTGGTGCTG GCGGGCATCG GCGGCCTGCT CGGCCTGCTC GTCGGCGCAG GCCTTGCCCA GCTGGTCGGG CTGCTGGTGC CGGCGATGCC GGTGGCCACG CCCTGGCGAT ACGCGCTCGC CGCCGAGGGG GTCGCCATCG TGGTCGGGCT CGCCGCCGGG GTCCTGCCCG CACGCAGGGC GGCACGGCTC GACGCGGTCG AGGCCTTGCG CGCGGAGTGA
|
Protein sequence | MRWRDALHLA LRAITAHRLR SLLTLLGIGV GIAAVILLTS IGEGLHRFVL AEFGQFGTNV INLHPGRQGA RGGPPGLPST ARDLTLDDAD ALARLPHVRH VTGSVSGNAE VRAQGRVRRS TVLGVGPQMQ EVYSMRVRVG QFLPPDEAAS ARAFVVLGPK LARELFGSTI ALGEHVEIGA ERFRVVGVME EKGQFLGIDL DDAAYIPVVR GMALYQRDGL MEIALTYDPE APAARVAEAV KKRMIARHGR EDFTVLTQED MLATLSNILD LLTAAVGALG AISLLVGGVG IVTIMSIAVT ERTGEIGLLV ALGARRRTIL ALFLGEAVVL AGIGGLLGLL VGAGLAQLVG LLVPAMPVAT PWRYALAAEG VAIVVGLAAG VLPARRAARL DAVEALRAE
|
| |