Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0682 |
Symbol | |
ID | 7083911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 764633 |
End bp | 765769 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643697708 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002354350 |
Protein GI | 217969116 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.58448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCCT GGATCGAAAG GCAGCGTCAC CTGATTGACT TCACCGCGCA GTCGCTCGCG CGGCGCAAGG GCAAGAGCCT CGGGCTGCTG TTCGTCTATA CGCTGCTCGT GTTCGTGCTC GCCTCGGTGG CGCTGTACAC CCACGCGCTG CGCAACGAGG CCACGCGGGT GCTGGCCGGC GCGCCGGAGA TCGTGCTGCA GCGCCTCATC GCCGGCCGCC ACGACCTGGT GCCGCCGGGC TACATCGAGC GGATCGGCCG CATCCGCGGC GTGCAGAAGA TCGAGGGCCG GCTGTGGGGC TACTACTACG ACAGCGTGCT GAAGGCCAAC TACACCTTCA TGGTTCCGGC CGATCGCGAG ATCGTCCCGG GCGAGATCGT CGTCGGCCCG GCGCTGGAGC GCACGCGCGG GCTGTCGGCG GGCAACGGCA TCTCCTTCCG CGCATATTCC GGCGAGTTGC ACACCTTCAT CGTCGCCGCA GTGCTGGCGC ACGAGTCCGA ACTGGTCAGC GCCGACCTGG TGCTGATGAA CGAGGCCGAC TTCCGCCGCT TCTTCGCCTA TCCGGACGGC CACTACACCG ACATCGCGCT GTGGGTGGCC AACCCGCTCG AGGTGCGCAA CGTCGGTGTC AAGCTGCTCG GCACCCTGCC CGACTCGCGC CCGATCCTGC GCGAGGAGGT GCTGCGCACC TATGCGTCGA TCTTCGACTG GCGCGAGGGC ATGATGCTGG CGCTGCTGTC GGCCGCCATC CTCGCCTTCG GCATCTTCGC CTGGGAGAAG GCTGCCGGCC TGTCGGCGGA AGAAAAACGC GAGATCGGCA TCCTCAAGGC GATCGGCTGG GAGACCGGCG ACGTGATCGC GATGAAGTTC TGGGAAGGCT TCCTGGTCTC GCTGTTCGCC TTCCTGGTCG GCTACGTCGC CGCCTACGTG CATGTGTTCC ACTTCGAGTT CACCCTGTTC GCACCGGTGC TCAAAGGCTG GGCGGTGCTG TACCCGAGCT TCGCGCTCAC GCCGCAGATC GACGGCCTGC AGGTGGCCAC GCTGTTCGTG TTCACCGTGC TGCCCTACAC CGCGGCCACC CTGGTGCCGA TCTGGCGCGC GGCGACCACC GACCCCGACA CCGTGATGCG GAGCTGA
|
Protein sequence | MKPWIERQRH LIDFTAQSLA RRKGKSLGLL FVYTLLVFVL ASVALYTHAL RNEATRVLAG APEIVLQRLI AGRHDLVPPG YIERIGRIRG VQKIEGRLWG YYYDSVLKAN YTFMVPADRE IVPGEIVVGP ALERTRGLSA GNGISFRAYS GELHTFIVAA VLAHESELVS ADLVLMNEAD FRRFFAYPDG HYTDIALWVA NPLEVRNVGV KLLGTLPDSR PILREEVLRT YASIFDWREG MMLALLSAAI LAFGIFAWEK AAGLSAEEKR EIGILKAIGW ETGDVIAMKF WEGFLVSLFA FLVGYVAAYV HVFHFEFTLF APVLKGWAVL YPSFALTPQI DGLQVATLFV FTVLPYTAAT LVPIWRAATT DPDTVMRS
|
| |