Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1625 |
Symbol | |
ID | 7084835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1820287 |
End bp | 1821309 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643698645 |
Product | protein of unknown function DUF1555 |
Protein accession | YP_002355276 |
Protein GI | 217970042 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00082408 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCA AGAAAACGAC GCTTGCGACG GCGCTGATCG GTGCGCTCTC CATGGGCTTC GCCGGACAGG CTTCGGCCAG CGTGTATGCC GGTTCGTCGC TGAACCTGAG CAATCTGACG ATCGGTTTCA TCAACGCTTC GACCGGTGTG GTGATCACTA ACCTGAATCC CGGCTACACC TTCAACGTCG AAGATTCCGC AACGCTGAAC GGAAGTACAG TCGCCGACGC CGATGCTTGC AGCAAGCTGC TTCTCAACTG CGGTGTATCG CCGGTTCTGA AGGTGAGTGC GGTAAACGCA CCCGGTAGCA CGGTGATTCG CGCCGACAAT GACTACTCGC TGTTTGGCCA GAACAGCGGC ACCTTCGCCA ACGCGAATGC TGAGATCACG TCCGCGCAAC TCGTGAATGG AACGCCGTCT TCGACGAAAC AGGTCGCCGA GGCCGAGTTG CAGATCTCGG GCCAAGGGCA AGCCTCGACC AACATTCAGT CGAATACTAC TTGGACCTTC TCGTTCTCCA TCGGTGAAAC GGCGAACATG GTGCTGAGCT TCCTGGCCAA TCCGGAGATG AAGGCCGACG TGACGTTGGT TCCGCCCTAC ACGGCGGGTA ATGCGCAAGC GAACATGTCT GCGGACTTCA CGCTTCGCAG GATCAGCCAA ATTTCAGGAG ATGCGGCCAC GCTGGCGCCG TTCGTGAACT GGACTCCGGA CGGTGTGAAC ACGAACGCGG TTTGCGTGAA TGTCGGTTCG TGCGTTGATG TGGACCCTGA AAGCCTGAAC GAGACCATGG GTGTCGGACC GGGCAACTCC ACCGTGACGC ATAGCCTCGG GACCGCCGGC AGCAACTACT CGCTGACCAT CACCGGTCTT ACGCGTGGCA ACTATTCGCT GACCCTCGCC GGCTTGACCA GCGTCAACGT CACGCAGGTT CCGGAACCTG GCACCCTCAT GCTGCTGGGT GGCGCGCTCG CCGCGCTCGG CTTCGGCGGA ACGCGTCGTC GCAGCCAAGC GACGGCGGCC TGA
|
Protein sequence | MNIKKTTLAT ALIGALSMGF AGQASASVYA GSSLNLSNLT IGFINASTGV VITNLNPGYT FNVEDSATLN GSTVADADAC SKLLLNCGVS PVLKVSAVNA PGSTVIRADN DYSLFGQNSG TFANANAEIT SAQLVNGTPS STKQVAEAEL QISGQGQAST NIQSNTTWTF SFSIGETANM VLSFLANPEM KADVTLVPPY TAGNAQANMS ADFTLRRISQ ISGDAATLAP FVNWTPDGVN TNAVCVNVGS CVDVDPESLN ETMGVGPGNS TVTHSLGTAG SNYSLTITGL TRGNYSLTLA GLTSVNVTQV PEPGTLMLLG GALAALGFGG TRRRSQATAA
|
| |