Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1949 |
Symbol | |
ID | 7084417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2192659 |
End bp | 2193630 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698974 |
Product | protein of unknown function DUF534 |
Protein accession | YP_002355596 |
Protein GI | 217970362 |
COG category | [R] General function prediction only |
COG ID | [COG2984] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.325569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTCC CCCTGTCCCG CAAGGTCCTG TTCGGCGCGC TGGCGCTCGC GTTCGCGATG GCGATGCCGG CGTACGCACA GCAGAAGTCG GTCGCCATCA CCGCGATCGT CGAGCATCCC GCGCTCGATT CGGTGCGCGA CGGGGTCAAG GAGGCGCTGG CCGCGGCCGG CTACGAGGAC GGCAAGAACC TGAAGTGGCA GTATCAGAGC GCCCAGGGCA ACACCGGCAC CGCGGCGCAG ATCGCACGCA AGTTCGTCGG CGACAAGGCC GACGTGATCG TCGCGATCGC CACCCCCTCG GCGCAGGCCG TCGCGGCCGC GACCAAGGAC ATCCCGCTGG TGTTTTCGGC GGTGACCGAT CCGGTGGTGG CCAAGCTGGT GCCCTCGATG CAGCCCTCGG GCACCAACGT CACCGGGGTG TCGGATGCCC TCGAGCTCGG CAAGCAGGTC GAGCTGATCA AGCGCGTCGT GCCCGCCGCC AAGCGTGTCG GCATCGTCTA CAACCCGGGC GAGGCCAACT CGGTGGTCGT GGTCGAGCAG CTGCGCGAGC TGCTGCCCAA GCACGGCCTG AGCCTGGTCG AAGCCGCGGC GCCGCGCTCG GTGGACGTCG GCTCGGCGGC GCGCAGCCTG ATCGGCAAGG CCGACGTGTT CTACACCAGC ACCGACAACA ACGTCGTGTC GGCCTACGAG GCCCTGGTCA AGGTGGGCAT GGACGCCAAG ATCCCGCTCG TGGCCGCCGA CACCGACAGC GTGGCGCGCG GTGCGGTCGC GGCCTACGGC ATGGACTACA AGGCGCTCGG CGTGCAGACC GGCGAGATCG TGGTTCGCAT CCTCAAGGGT GAGAAGCCCG GTGCGATCGC CTCCGAGACC AGCAACAAGC TGTCGCTGCA GGTGAACCCG GCTGCGGCGC AGAAGCAGGG CATCACCCTG GCGGAAGACC TCGTCAAGTC GGCTGCCAAG GTCGTCCAGT AA
|
Protein sequence | MSFPLSRKVL FGALALAFAM AMPAYAQQKS VAITAIVEHP ALDSVRDGVK EALAAAGYED GKNLKWQYQS AQGNTGTAAQ IARKFVGDKA DVIVAIATPS AQAVAAATKD IPLVFSAVTD PVVAKLVPSM QPSGTNVTGV SDALELGKQV ELIKRVVPAA KRVGIVYNPG EANSVVVVEQ LRELLPKHGL SLVEAAAPRS VDVGSAARSL IGKADVFYTS TDNNVVSAYE ALVKVGMDAK IPLVAADTDS VARGAVAAYG MDYKALGVQT GEIVVRILKG EKPGAIASET SNKLSLQVNP AAAQKQGITL AEDLVKSAAK VVQ
|
| |