Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0018 |
Symbol | |
ID | 7085116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 24814 |
End bp | 26631 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643697068 |
Product | hypothetical protein |
Protein accession | YP_002353717 |
Protein GI | 237653094 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCAC TCATCGAGCT TTTCCCGCCC GAGCCGACCG AAGGACCGAT CGAGTATCTG CGCCGGCTCT CGGTTCGCAA CGGCTATAGC AATTGGCGCA GTCTCGTCCG CGCCACGGGC GTGAATCCCA CATTGAACGC ACTTTGGAAG AACAAGCCCA CGCTCATCGA CGCGCTCGGC CTCGATCCTG CGTGGCTGGA TGGGGTCATG CCGCCATCGG CCGAAGGCCG AGGCCTGCAC GACCCCTTCT TCCGCCGTAC CGCCACGGAT CCCGTCTGCC CCCAGTGTCT GGCCGAAGGC GATCATCTGC ACCACGCCTG GTCGCACAGC TTGGTGAGCG CGTGCCCCGA CCACGGCACC GCGCTCATCG ACACGTGCCC ATCGTGCGAC CAACCCTTGT CACCGGATCG GTTCGACCTC GCGACCTGCG ACTGTGGGTA CCCCCTGCAC GAAGCAATCT CGCCGTCGGC GAGCCCGTTC GAGCGCTGGG TCAGTGCACG CCTTGCCGGC GATCTGCGGC CCGTTGACGG ACTACCGGAG ATCGGCACCC CGGAGGACTA CAGCGGCCTC GGCAAGCTGC TGTTCTCGCT CGCCATCAGG CTCGACCCAA GCGCGAAGGT TAAGGCCGGC AAAACGTCCC GGCCCCATGA CCTGGCGCAG ACACGAGCGC TCATCGAGTC GATCTGCCCG CTGTTCGAAG CGTGGCCTCA AGGCTTCAAC ACCCATGTCC GCGATCGGCT CGTGGCGGGT AATCAGGCCG TCTTCAGCCC CTCTGGCCGG CTCGGCGCCT GGTACATGAA TCTTCATGTT GCGTGCCGCA AGGCAAAGGC ATTTGCACCC TTGTGGACCG CGTTCTCGGA CGCGATGATC GACCATTTCG ACGGCAATCT GCGCGGTCAG CAAGTGCTGA CCCCGTCGCC GGAGCGGGAG CGCCGCTTCG TCCCTGTTGC CGAGGCGGCC CGGCTGATCG GCGTGAGCGG GCCCAAGCTC GGCGCCGCGC TGAAGGCCGG GATTGTGGCC GGCCACGTCA GCAAACAAGG CACAGCCTAT ACGCTGGCGC TCATGGACCG CGAAGAGGTC GAGCGCATCC GGACCGAACG CGCGCGCTGG ATCACGGCCA ACGACGCGGC CGTTGCTGCC GGCGTACCGC CCTCGGTCAT CGCACACCTT GCCGAGGCCG GCCTGCTCAA GAACGATGAC GAGTGGTCAC AAGACATCCT CAAGAGCGGG CCAATCGAAA AGGCCAGTGT TGAAACACTT CTCGAACACC TGAAATCCCT TGTGGAGCCC CGCGAGGTCG CAGAAACGCT GCGCTTCGGA CAGCTCATCG GACGCCGCAC GACGGACAGC AAAGGCATCC GCAAGCTGTA TCAAGCCATC CATTCGGGTG AAGTCCGTGC GACCGGTTAC ACGCCCGGTT GCGGCCTGGA CGGGCTCACC TTCGCCACCG AGGAAATCCG CCGCTACCTC GGATCCGTCG GGCTGACCGA CGGCATGACG CTGACGCAGC TCGAGAAGGT GACCGGCTGG AAATATGAAG CGCTGAGCCA CTGGGCAGAG AGAGGTCTGC TGAAGACGAT CGACGTGCTG TTGCAGGGGC GCTCGGCCCG CCTCGTCACG AACGCCGCCC TGGCGGATTT TCGTCGCGTG TGGATCCCTG TGTCCGATCT GGCCCAGGCA ATGGGCACGA AGTCGTCTGC GCTGACTGCG AAGATCACAC AGCGCGGCAT CCCGATCCAC GGCCAACTCG CGCTCCCGTC CGGGGCAAAG CGGGGCGGGC TCCTGCAACT CGCGGATCTC GGCGCCCTGA TTGGTTAG
|
Protein sequence | MTALIELFPP EPTEGPIEYL RRLSVRNGYS NWRSLVRATG VNPTLNALWK NKPTLIDALG LDPAWLDGVM PPSAEGRGLH DPFFRRTATD PVCPQCLAEG DHLHHAWSHS LVSACPDHGT ALIDTCPSCD QPLSPDRFDL ATCDCGYPLH EAISPSASPF ERWVSARLAG DLRPVDGLPE IGTPEDYSGL GKLLFSLAIR LDPSAKVKAG KTSRPHDLAQ TRALIESICP LFEAWPQGFN THVRDRLVAG NQAVFSPSGR LGAWYMNLHV ACRKAKAFAP LWTAFSDAMI DHFDGNLRGQ QVLTPSPERE RRFVPVAEAA RLIGVSGPKL GAALKAGIVA GHVSKQGTAY TLALMDREEV ERIRTERARW ITANDAAVAA GVPPSVIAHL AEAGLLKNDD EWSQDILKSG PIEKASVETL LEHLKSLVEP REVAETLRFG QLIGRRTTDS KGIRKLYQAI HSGEVRATGY TPGCGLDGLT FATEEIRRYL GSVGLTDGMT LTQLEKVTGW KYEALSHWAE RGLLKTIDVL LQGRSARLVT NAALADFRRV WIPVSDLAQA MGTKSSALTA KITQRGIPIH GQLALPSGAK RGGLLQLADL GALIG
|
| |