Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2770 |
Symbol | |
ID | 7873510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2999269 |
End bp | 3000165 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699692 |
Product | UspA domain protein |
Protein accession | YP_002889747 |
Protein GI | 237653433 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0325172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCGA TCCGCACCCT CGTCGCCCCC ACCGACCTCT CGGCGCTCGC CCGTCATGCC GTGGTCCGTG CCTGCCTGCT CGCGGCCGAG CTCGGTGCGC GGGTGTCGCT GCAGCACGTG GTCAATTCCG GCGCGCTCGA CACCTTGCGC CAGCTGCTCG ACACCGACTC CGTCGGGCTG CAGGAGAAGC TGCTCGAGGA GGTGCGCGGC GAGGTCGAAG CGCTCGCCGC CGAGATGCAC AAGCGCCACG GGGTGACGCC CGAGCTGCAC CTGGCGGTGG GTGGCGTGCT CGGCGAGATC GTCGCCCACG CCGAGGCGAT CGACGCCGAC CTGCTGGTGA TGGGGGCGCG CGGCGCAGGC TTCATGCGCG AGCTGCTGAT CGGCTCCACC ACCGAGCGCG TGCTGCGCAA GTCCGTGCGC CCGATGCTGG TGGTCAAGCA GATCGCCCAC GAGCCCTATC GCCGCGTGCT CGTGCCGGTG GACTTCTCGG CACGCGCGCT CGAGGCGCTG GAGTTCGCGC GGCGCGTGGC CCCGCAGGCC GAGTTCGTGT TGCTGCACGC CTTCGAGGTG CCCTTCGAGG GCAAGCTGCG CTATGCCGGC GTGGAGGAGA GCGCGCTGTC CTCGCTGCGC GTCAACGCGC GTCGCGAGGC GGGTGCGCAG ATGAACGAGC TGGTCGCGCG CGCGCGGGTG GACGAGAACC GCGTGCGCCG CATCGTGGTC CACGGCGAGG CCACGACACA GATCCTCGAG CAGGAGCAGG AACAGGATTG CGACCTGATC GTGATCGGCA AGCGCGGCCA CGGCCTGCTC GGCGAAATGC TGCTCGGCAG CGTCACCAAG CACGTTCTCG CGCGTTCGAC GGCCGATGTG CTGGTCTGCG ACCGCAAGCC CGACTGA
|
Protein sequence | MNPIRTLVAP TDLSALARHA VVRACLLAAE LGARVSLQHV VNSGALDTLR QLLDTDSVGL QEKLLEEVRG EVEALAAEMH KRHGVTPELH LAVGGVLGEI VAHAEAIDAD LLVMGARGAG FMRELLIGST TERVLRKSVR PMLVVKQIAH EPYRRVLVPV DFSARALEAL EFARRVAPQA EFVLLHAFEV PFEGKLRYAG VEESALSSLR VNARREAGAQ MNELVARARV DENRVRRIVV HGEATTQILE QEQEQDCDLI VIGKRGHGLL GEMLLGSVTK HVLARSTADV LVCDRKPD
|
| |