Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2231 |
Symbol | |
ID | 7083663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2514211 |
End bp | 2515395 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643699251 |
Product | CRISPR-associated protein, Cse4 family |
Protein accession | YP_002355867 |
Protein GI | 217970633 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTCC CGCGCTTCAT CCAGATCCAC ACCCTGCACA CCTACCCCGC TGCCCTGCTC AACCGTGACG ATGCCGGACT CGCAAAACGC CTCCCCTACG GCGGCGCGAT CCGAACGCGC ATTTCCTCAC AGTGCCTCAA GCGCCACTGG CGCGTCGCCG ACGACGCGTT TTCGCTGGCG AAACTGGGCG TGCCGATGGC CACGCGCACA CGCTACGTCG CCGAATTGAT TCGCCAACGC CTTATCGAGC AAGGCATCGA CGAAGCGCGC GCCTACGCTA CCGCCGAGGC CCTGCTTGAG GCCCTGTTCG GCGAGAAGGC CGACAAGAAG AAGGAAGGCG TCAAGGCACT TCAAACCGGG CAGGCGGTGC TCTTCGGCAA CGAAGAAATC GCTTACCTTG CGCGCCGCTG CCGAGACATC ACTGGCGACT TTTCCGATCC AGTCGCGCTG AAGGCAGAGG TGGCGAAGTT CCTCAAAGAG GAAAAGAAGA ACATCGAGGC GATGAAGCTC GGCAGCGGCC TCGAATCGGC TCTCTTCGGT CGCATGGTTA CCTCTGACCT GCTTGCCAAC CGCGACGCAT CGGTGTCGGT CGCCCACGCC TTCACGGTGC ATGAAGCGCA GGTCGAAAAC GACTACTTCA CCGTAGTCGA TGATTTTGCC CAGGCGGAAG ATGGTGCGGG CTCGGCCGGC ATCTTCGATA CCGAACTCGC CTCGGGGTTG TACTACGGAT ACGTGGTTAT CGACGTGCCG CAACTCGTTG CAAACCTCGA GGGCATCAAA GTCGAGGATG TCTTCACGAT CGGGGCCGAC AAGCGTGGTT TGGCCGGCAA GGTCGTCCAA CATTTGCTGC ACCTTATCGC CACCGTGAGC CCCGGCGCCA AGCGTGGATC CACTGCACCA TACGACTGGG CAAAGTTCGT CTTGGTCGAG GCCGGTGACT GGCAACCGCG CAGCCTTGCA GCAGCTTTCC ACGATCCAAT ACCGCTCAAG GGCGACTCTT CGATCCGTGG CCGCGCCGCT AGCAAACTGG CCAAAGAGAT CGCGGCCTTC GACGCAGCAT ACGGAATGCC TACGGCGCGC CGGTTCCTGT CGCTGGACGA GTTGGCTGTT CCCGCCGCGG AGCGCGCGAC GCTCTCACAA CTGGGTGAGT GGATCGCACA AACCGTTCGC GACGGCGCGT GCTGA
|
Protein sequence | MSLPRFIQIH TLHTYPAALL NRDDAGLAKR LPYGGAIRTR ISSQCLKRHW RVADDAFSLA KLGVPMATRT RYVAELIRQR LIEQGIDEAR AYATAEALLE ALFGEKADKK KEGVKALQTG QAVLFGNEEI AYLARRCRDI TGDFSDPVAL KAEVAKFLKE EKKNIEAMKL GSGLESALFG RMVTSDLLAN RDASVSVAHA FTVHEAQVEN DYFTVVDDFA QAEDGAGSAG IFDTELASGL YYGYVVIDVP QLVANLEGIK VEDVFTIGAD KRGLAGKVVQ HLLHLIATVS PGAKRGSTAP YDWAKFVLVE AGDWQPRSLA AAFHDPIPLK GDSSIRGRAA SKLAKEIAAF DAAYGMPTAR RFLSLDELAV PAAERATLSQ LGEWIAQTVR DGAC
|
| |