Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3284 |
Symbol | |
ID | 7874182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3599380 |
End bp | 3600807 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700218 |
Product | hypothetical protein |
Protein accession | YP_002890256 |
Protein GI | 237653942 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTGT CGAAGAAGCT GCCTTGTCTG ACCGTTCTTG CCCTTGTCCT GGGGGCCCTC TCCGTGCCGG CGACTGCGGC GGGGCCGCAG CTCGCGCCAA AACCGGTGTC GGTGGAGGTG GAAGAAGGGG AGCCCGCTGC CGAAGAACCC GTGACGGACG CGGACGCGGG TGAGGCCGAC GCCGAACTCG ACGCGCTGCT CGACGTCGGA GCGCCGGCGT CCGGTGCAGA GGGCGCGGAC GGCCTCAAGT GGAACGGTTA CACCGAACTC GGGGCGGCAT ACACATACCG GGATCCGGAG CGCTGGTCGC GCCTGCGCGC GCGTGGGGAA CTCGGTGTCG CCGGGCAGCT GGCGCCGCGG GCGCGATGGA AGCTGAGCGC GCGCGCCGAG GCCGACGGTG CTTTCGACCT CGAGGACGAA CATTATCCGG CCGCCGTGCG GCGCGACCGC CGCACGGATT TCGTGTTGCG CGAGGCTTAT GTCGATCTGG GCCATGGCGA CTGGGAATTC CGTGCCGGGC GGCAGCAGAT CGTATGGGGC GAGATGGTCG GCTTCTTTTT CGCCGACGTC GTGTCCGCAC GCGACATGCG GGATTTCCTG CAGCCGGAAC TCGAGGGCAT GCGGATTGGG CAGTGGGCGC TGCGCGCCGA GCATTTCGGT GCCGAAACGC ACCTAGAGTT CCTGTGGGTG CCGAAGCCCT CGTTCGACGA GATCGGCGAG CCCGGCGACG ATTTCTTCGT CTTCCCGTGG TTGCCTGCAG GTACGGTGCT GGACGAGGAT CGTCCGGGGA AGGATCTGGA TGGCTCGAAC TGGGGTGCGC GCGTCTCGCG ACTCGTCGAT GGGTGGGATC TCAGCGCCTT CTATTACCGC AGCTACGATG TCTCGCCGAC GCTATACGCC CTGAACCAGG GTGTGGCCCG CCTGCGACAC GACCGGATCG GGCAGCTTGG CGGCACTTTC AGCAAGGATC TCGGCAGCTT CGTGCTGAAG GGCGAGGCGG TCCATACGCA CGGTCGCAGC CTGAATACGT TCTCGAGCGG TCCGGGCCTC TCCATCGGGC TGCTGCCGAC CGACATGGTC GACTATGCGC TCGGGGTCGA TGTGCCCGCG GGCGACTGGC GCTTCAACGT GCAGTACTAC GGGCGCTGGC TGGAGGAGCA TGTTCCGGCC TTGATGGCCG ATCGGCACGA GCAGGGCGTC ACCTTGCAGG TCGTTCATGG CGCGGGCACG AACCTGGAGG CGGAAGTCCT CGCGCTGTCC AGCATTAACC GGTCGGATCA CCTGATTCGA CCGAAGCTGA CCTGGAAATT CGCCCCGGCA TGGAGGCTCG TCGGTGGCGT CGACGTCTTC GGCGGGAATG GAAGGGGGTT TTTCAGCCGT TACGATCGGA ACGATCGTGT CTATCTCGAA CTGCGCCACA TGTTCTGA
|
Protein sequence | MPLSKKLPCL TVLALVLGAL SVPATAAGPQ LAPKPVSVEV EEGEPAAEEP VTDADAGEAD AELDALLDVG APASGAEGAD GLKWNGYTEL GAAYTYRDPE RWSRLRARGE LGVAGQLAPR ARWKLSARAE ADGAFDLEDE HYPAAVRRDR RTDFVLREAY VDLGHGDWEF RAGRQQIVWG EMVGFFFADV VSARDMRDFL QPELEGMRIG QWALRAEHFG AETHLEFLWV PKPSFDEIGE PGDDFFVFPW LPAGTVLDED RPGKDLDGSN WGARVSRLVD GWDLSAFYYR SYDVSPTLYA LNQGVARLRH DRIGQLGGTF SKDLGSFVLK GEAVHTHGRS LNTFSSGPGL SIGLLPTDMV DYALGVDVPA GDWRFNVQYY GRWLEEHVPA LMADRHEQGV TLQVVHGAGT NLEAEVLALS SINRSDHLIR PKLTWKFAPA WRLVGGVDVF GGNGRGFFSR YDRNDRVYLE LRHMF
|
| |