Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2015 |
Symbol | |
ID | 7083771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2277947 |
End bp | 2279053 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699040 |
Product | CBS domain containing membrane protein |
Protein accession | YP_002355661 |
Protein GI | 217970427 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3448] CBS-domain-containing membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00292892 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTCG CCCGACTGTT CTCCCTGCCT GCCGCCTTGT CTGCGGGGCG GCGCGAGATG GCACTGGGGA CGCTGGGGGT GGCGGTAGGG TTGGGTTGCA CCGAGTGGAT CGGCTTCCTC GCGCTGGGCG GGCAGGAACC GTGGTTCATC CTCCCGATGG GGGCGTCGGC GGTGCTGCTG TTCTGTGTCC CGGCGAGTCC GCTCGCGCAG CCCTGGCCAA GCCTCGTCGG CAACCTGGTG TCGGCGCTGA TCGGGGTAGG CTGCTATCGC TGGCTGGGCG AGACCGGGAT GGCCGTCGCG CTCGCCGGCT GCCTGGCGGT GGGGGCGATG TTCCTGCTGC GCTGCCTGCA TCCGCCGGGC GGAGCGGTGG CGCTGACGGC GGTGCTGGGT GGTCCGATGG TGCATGAACT CGGCTACGCC TTCGCGCTGA TGCCCGTGCT CGTCAATACG CTGGCGATGC TGGTGGTCGC CTTCCTGTTC AACAACCTCG TGGGACGGCG CTATCCGCAC CTTGCGCCCG CGCGTGCGCA GGCGCACGGC ACGGCCGATC CGCTGCCGAG CAAGCGCGTC GGCTTTCGTG CGGAGGACCT CGACGCGGCG CTGGCCTCTT TCGGCGAGGT GCTCGACGTC GATCGCGACG ACCTCGAGGA GATCATGGTG CGCGCGCAGA TGAACGCGCG CCGGCGCACC TGGGGCGCGC TGCGCTGCGC CGACATCATG TCGCGCGACG TGGTGAGCGT GGGTCCGCAG GCCCCGGTGG GCGAGGCCTG GGCGCTGCTC GCGCACCACC GCATCAAGGC CTTGCCGGTG GTGGAGGAGG GCGGCCGGCT GGTCGGCATC GTATCGGTGC CGGACTTCTT CATCGACCGC CACAACCCCG AGCCGCAGCC GGTGCCGCGC ATGCGCACCG CCCGCGTGGT CGCCGAGATC ATGAGCGGGC GCGTGCACAG CGCCCGCCCC GGGCAATCGC TCGCCGACCT GGTGGGGGCC TTCTCGGACG GCGGCCTGCA TCACCTGCCG GTGGCCGACG AGGACGGCAG GCTGGTCGGC ATGATCACCC AGTCGGACGT GGTGGCGGCG CTCTTCGCCG GCGAGCGCGG GGCTTAG
|
Protein sequence | MSLARLFSLP AALSAGRREM ALGTLGVAVG LGCTEWIGFL ALGGQEPWFI LPMGASAVLL FCVPASPLAQ PWPSLVGNLV SALIGVGCYR WLGETGMAVA LAGCLAVGAM FLLRCLHPPG GAVALTAVLG GPMVHELGYA FALMPVLVNT LAMLVVAFLF NNLVGRRYPH LAPARAQAHG TADPLPSKRV GFRAEDLDAA LASFGEVLDV DRDDLEEIMV RAQMNARRRT WGALRCADIM SRDVVSVGPQ APVGEAWALL AHHRIKALPV VEEGGRLVGI VSVPDFFIDR HNPEPQPVPR MRTARVVAEI MSGRVHSARP GQSLADLVGA FSDGGLHHLP VADEDGRLVG MITQSDVVAA LFAGERGA
|
| |