Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0011 |
Symbol | |
ID | 7085109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 17927 |
End bp | 19006 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643697061 |
Product | hypothetical protein |
Protein accession | YP_002353710 |
Protein GI | 217968476 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT TGTCCGATAC CCCTCAGAGC GAGGGCGCCT GCCCGTTGCC GAAGCGCGTC GAGTTTGCGC AGTGGCTTCT GCGCTTCGTG GATCCTGACC GCAGCTCGGA CTACCGCGAC GCGTTCTGGG ACTTACTGCG TGCACTGCTG ACCCAGTACC AGGGCGATGG CAAGGACGGC ATCGAGCCGA GTTCCCTCAG CACCCAGGAT CTTTGGCGCT GGGCGCTCGA TGGACGCGGT AAGCCATTCG ATGACCTGAC CAGCGAAAAG GCACGCAAGC GGGTGAATCC ACACTGGTCT GCGTTGGAGC AGTCGTTCGC CGAGCTGCAG GGACGCTTCA CCGACGCAGC GCGGAAGGCG GGCTTCACCG GGCTTCTGTG GCCGACCAAG GACAAGTCGG AGGGTGGGCG GTCGTCGACC TATCGTCTCG AGTGGCGTCC CTTCGGTGTG AGCACCGCGC TCCCTACCCT GCCAGCGGAC TACGAAAAGC AGCCCTTTGT GCTTCGGTAC CGTGAGGACC GGTCGAGTCT GCGGTTCTCG CTCCTCGGGC GACTGTTTCT GTGGCCGCTG CTGTTCCGCC GCCACGGCGA TGCGAGCATG CGCATCGATG GCGTGCGGCG CTATGTCCCG GCGGTCGGTC TGGGGTTGAT GATCCTGCTC GCGGGGCTGC CGGTCGTCAT TGTATTGTTG CAACTGGGCA AGGGCGTGCT CGAGTGGGGA TGGATCGCCG TCGCGGCCGT CTTCTTCTGG TTTCCGCTCC GGCCCCTGAT GCGGCTCTAC GATCGATTCA TCATCATGGC CTCGCCGCTC TTCTATCCGT TCTCGGAGAA GGAGTGCCAG GTCGAACTGG TCCGGGACCC TGAAGCAGCG GTGCCGCCAG GGAGGCAGCG CGGTTACAAC CTGCGCCTGG TGCGGTATGT GGCGGATTGC CCGCTGTGTG GCGGCAACGT GTCGCTACGG GATGGCGGGC TTGGCCAGTT CAACCGGCTG GTCGGGTGCT GTGACGAGGA GCCTGGGGAG CACGTTTTCA GTTTTGATCG GAAGCTGCGG GCGGGGCACT GGCTCCGAAG TCGTTGGTGA
|
Protein sequence | MSDLSDTPQS EGACPLPKRV EFAQWLLRFV DPDRSSDYRD AFWDLLRALL TQYQGDGKDG IEPSSLSTQD LWRWALDGRG KPFDDLTSEK ARKRVNPHWS ALEQSFAELQ GRFTDAARKA GFTGLLWPTK DKSEGGRSST YRLEWRPFGV STALPTLPAD YEKQPFVLRY REDRSSLRFS LLGRLFLWPL LFRRHGDASM RIDGVRRYVP AVGLGLMILL AGLPVVIVLL QLGKGVLEWG WIAVAAVFFW FPLRPLMRLY DRFIIMASPL FYPFSEKECQ VELVRDPEAA VPPGRQRGYN LRLVRYVADC PLCGGNVSLR DGGLGQFNRL VGCCDEEPGE HVFSFDRKLR AGHWLRSRW
|
| |