Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1621 |
Symbol | |
ID | 7084831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1815565 |
End bp | 1816779 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643698641 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002355272 |
Protein GI | 217970038 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGTCA GACTGGCCCT GCGCAACCTG CTGCGGCAGA AGGTCCGCAG CGCGATGACG CTTGCCGCGA TCGTGTTCGG CGTGGCGGGC CTGATCGTGT CGGGCGGATT CATCCAGGAC ATCTTCCTCC AACTCGGCGA GGCGATCATC CATTCGCAAA CCGGCCACGT GCAGGTCTTC CGCCAGGGCT TCATCGAACG CGGCACGCGC CAGCCCGAAC GATTCCTGAT CGAACGCCCG CGCGAAACCG CAACGAGAAT CGAGGCCCTC CCCGAGGTCG ACGAGGTCGC GCTGCGCCTG AGCTTCGCCG GGCTGCTGAA CAACGGCAAG CGCGACCTCG GCATCGTCGG CGAGGGCATC GAGCCCGGCA AGGAGGCACG ACTCGGCAGT TTCCTCCAGA TCCTGTCGGG CCGCGCGCTA CGCGAGGACG ACGTGTTCGC GATGGTCGTC GGCCAGGGCG TCGCCCACTC GCTGAGCCTC GCGGTGGGCG ATCGGGTGAA CCTCGTCCTG AGCACGGCCG AGGGCGCGAT GAACATGCTC GACTTCGAGA TCGTCGGCAT CTTCCAGAGC TTCTCCAAGG AGTTCGACGC GCGCGCGCTC CGCATACCGC TCGGCGCGGC GCAGGAGTTG CTCTTCACCG ACGGCGCCAA TCTGCTGGTC GCGACCCTTC ACCGCACCGA GGACACCGAC CTCGCCCACG CCCGCATCGC GCAGGCGGTC GCCGGCAGCG AGCTCGAGGC CCGGCACTGG CGCATGCTTT CGGATTTCTA CGAAAAGACG CTGGCCTTGT ACGAACGCCA GTTCGGCGTG CTGCAGGCCA TCATCCTGGT GATGGTGCTG CTGTCGGTCG CGAACAGCGT CAATATGACA GCCTTCGAGC GCCTGTCCGA ATTCGGCACC CTGCTGGCGC TCGGCAACCG GAATGCCAGC ATCTTTCGTC TGATCCTGAT CGAGAACATC CTCCTCGGAT TGATTGGCGC CGCCCTCGGC ACGCTGATCG CGCTGGGCAT CGCACTTGCG GTGTCCGCCG TCGGCATCCC GATGCCGCCG CCGCCCAACT CGAACGTGGG CTACACCGCC ATGATCCGGA TCGTACCGGC GACAATCGCC TCCGCGTTCA TGATCGGCCT CCTCGCCACC GCGCTCGCAG CCTTGCTTCC GGCACGGCGG ATCTCCCGCA TTCCGGTCGT CAGCCTGCTC CGTAACGGAA ATTGA
|
Protein sequence | MIVRLALRNL LRQKVRSAMT LAAIVFGVAG LIVSGGFIQD IFLQLGEAII HSQTGHVQVF RQGFIERGTR QPERFLIERP RETATRIEAL PEVDEVALRL SFAGLLNNGK RDLGIVGEGI EPGKEARLGS FLQILSGRAL REDDVFAMVV GQGVAHSLSL AVGDRVNLVL STAEGAMNML DFEIVGIFQS FSKEFDARAL RIPLGAAQEL LFTDGANLLV ATLHRTEDTD LAHARIAQAV AGSELEARHW RMLSDFYEKT LALYERQFGV LQAIILVMVL LSVANSVNMT AFERLSEFGT LLALGNRNAS IFRLILIENI LLGLIGAALG TLIALGIALA VSAVGIPMPP PPNSNVGYTA MIRIVPATIA SAFMIGLLAT ALAALLPARR ISRIPVVSLL RNGN
|
| |