Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3488 |
Symbol | |
ID | 7872994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3826354 |
End bp | 3827652 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700428 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002890459 |
Protein GI | 237654145 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTGTCA ACGAACTGCT CGTCATCCTG CTGCTGATCG CAGCGAGCGC CTTCTTTTCC ATGTCCGAGA TCTCCCTCGC GGCTGCGCGC AAGATCAAGC TGCGCGTCAT GGCCGAGGCA GGGCATCTCA ACGCCCGGCG CGTGCTTGCG CTGCAGGACA GTCCGGGCCA CTTCTTCACC GTGGTGCAGA TCGGCCTCAA CGCCGTCGCC ATTCTCGGCG GCGTAGTCGG CGAACAGGCG CTTTCTCCCT ACGTCACCGA GCTGCTGCGC CGGGCCTACG CCGGCCCGAT GCTCGACACC ATCGCCTTCG TCGTCTCCTT CGTTTTCGTC ACCTCGCTGT TCGTGCTTTT TGCCGACCTC ATGCCCAAGC GCCTGGCCAT GGTCCAGCCC GAACGCATCG CGGTGGCGGT GGTGCAGCCG ATGCAGGTGT GCATGTGGCT GTTCGCGCCG CTGGTGTGGG TCTTCAACGG CGCGGCCAAC CTCATCTTCC GCTGGTTCAA GTTGCCGAGC GTGCGCATCG AGGACATCAC CGCCGACGAC ATCATGGCCA TGGCTGACGC CGGTGCCCAG GCCGGCGCGC TGCTGCGCCA GGAGCAGCAC CTGATCAGCA ACGTGTTCGA GCTCGACTCG CGCATCGTGC CCTCGGCGAT GACCTCGCGC GAGAACATCG TCTTCCTCAC CCTGTCGGAG TCCGAGGAGA GCATCCGGCG CAAGATCGCC GCCCACCCGC ACGGGAAGTT TCCGGTGTGC GAGGACGGCA TCGACAGCGT GATCGGCTAT GTGGACTCGA AGGACATCCT GCCGCGCATC GTGCAGGGCC AGGATCTGTC CTTGCGCACC CAGCCGATCG TGCGCAAGGT GCTGATGCTG CCCGACACGC TCACGCTCTT CGAGGCGCTC GAGCGCTTCC GCGACGCCAA GGAGGATTTC GCGCTCATCC TCAACGAATA CGCGCTGGTG GTGGGCCTGC TGTCGCTGCA GGACGTGATG AACACGGTGA TGGGCGATCT CGTCAGCCCC TTCCAGGAAG AGCTCATCGT GCGCCGAGAC GACAACTCCT GGCTCATCGA CGGCGCCACG CCGATCGAGG ACGTCATGCA GGCGCTCGAG ATCGAGGTCT TCGAGGGCTT CCAGAACTAC GAGACCGTCG CCGGTTTCCT GATGTACCGC CTGCGCAAGG TCCCCAAGCG CACCGACTTC GTGACCTACC TCGGCTACAA GTTCGAGGTG GTCGACATCG ACAACTACCG CATCGACCAA GTGCTGGTCA CCCGCGAGAC CCCGGTCGGC GCCGTGTAA
|
Protein sequence | MPVNELLVIL LLIAASAFFS MSEISLAAAR KIKLRVMAEA GHLNARRVLA LQDSPGHFFT VVQIGLNAVA ILGGVVGEQA LSPYVTELLR RAYAGPMLDT IAFVVSFVFV TSLFVLFADL MPKRLAMVQP ERIAVAVVQP MQVCMWLFAP LVWVFNGAAN LIFRWFKLPS VRIEDITADD IMAMADAGAQ AGALLRQEQH LISNVFELDS RIVPSAMTSR ENIVFLTLSE SEESIRRKIA AHPHGKFPVC EDGIDSVIGY VDSKDILPRI VQGQDLSLRT QPIVRKVLML PDTLTLFEAL ERFRDAKEDF ALILNEYALV VGLLSLQDVM NTVMGDLVSP FQEELIVRRD DNSWLIDGAT PIEDVMQALE IEVFEGFQNY ETVAGFLMYR LRKVPKRTDF VTYLGYKFEV VDIDNYRIDQ VLVTRETPVG AV
|
| |