Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1578 |
Symbol | |
ID | 7084782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1755224 |
End bp | 1756345 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698595 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002355232 |
Protein GI | 217969998 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.373133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACC AGGTCGCGGA GAAGGTCGTC CGCCGCGTCA TCCTCGGCTT CCTGCTCGGC GGCCTGCTGT TGCTGAGCTA TGCGGTGCTG CACCTGTTCA TCGTGCCGGT GGCGTGGGCG GTGATCATCG CCTATGCCAC GTGGACGCCT TACCGCCAGC TGCGCTCGCG GTTACCGCGC TATCCCACGA TCAGCGCGCT GCTGATGACG CTTTTGCTGA GCGCCGCCTT CGTGCTGCCC GCGCTGTGGA TCGGCATGCT GCTGCGCACC GAGGTCGGTG TCGCGATCGC CGCGGTGACC GCGCAGATCC GCGAGGGCGC CTTCGTGCTG CCCGAGTTCG TCCGCACGCT GCCGTGGATC GGCGACGACC TGCAGGCGAT GGTGGGCGAG CTCACCCGCG AGCCGGAGGC GCTGCGCGCG CAACTCACCG AGTGGGTGCG CCAGGGCAGC GACCTCGCGC TGACGCTGAT CGGCGACGTC GGGCGCAACG CGGCCAAGCT GGGTTTCGCA CTCATCACCG TGTTCTTCCT CTACCGCGAC GGCGAGCGCG TGCTCGAGCA GGTGGTGGCG GTGCTGCGCC GCTTCCTCGG CGAGCGCCTG GACCCCTATC TCTCCGCGGT GGGCGGGATG ACCAAGGCGG TGGTGTGGGG CCTGATCGCG ACCGCGATCG GCCAGGGTTT CGTCGCCGGG CTGGGCTACT GGTGGGCGGG CGTGCCGGCA CCGGTGCTGA TGGGGGCGAT CACCGCCGTG ATCGCGATGA TCCCCTTCGG CACGCCCTTC GCGTGGGGCT CGATCGGCGC CTGGCTGCTG CTCACCGGCA ACACCGTCGA GGGCATCGGG CTGCTGCTGT GGGGCGCGCT GGTGGTGAGC TGGGTGGATA ACCTGGTGCG TCCGCTGGTG ATCAGCAATG CCACCCGCAT CCCCTTCCTG CTGGTGATGT TCGGCGTGCT CGGCGGGCTG TCCGCGTTCG GGCTGGTCGG CCTGTTCCTC GGGCCGGTGG TGCTGGCAGT GCTGATGGCG GTGTGGGGCG AGTGGCTGGA GGAGTCGGAG TTCGCGCGCC TTGCGGCGAT CGGAGCGCCC GCGGGCGCGA GGGAGGAGGT GGAGCGCGGC GGTCGGTCCT GA
|
Protein sequence | MIDQVAEKVV RRVILGFLLG GLLLLSYAVL HLFIVPVAWA VIIAYATWTP YRQLRSRLPR YPTISALLMT LLLSAAFVLP ALWIGMLLRT EVGVAIAAVT AQIREGAFVL PEFVRTLPWI GDDLQAMVGE LTREPEALRA QLTEWVRQGS DLALTLIGDV GRNAAKLGFA LITVFFLYRD GERVLEQVVA VLRRFLGERL DPYLSAVGGM TKAVVWGLIA TAIGQGFVAG LGYWWAGVPA PVLMGAITAV IAMIPFGTPF AWGSIGAWLL LTGNTVEGIG LLLWGALVVS WVDNLVRPLV ISNATRIPFL LVMFGVLGGL SAFGLVGLFL GPVVLAVLMA VWGEWLEESE FARLAAIGAP AGAREEVERG GRS
|
| |