Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2234 |
Symbol | |
ID | 7083666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2516835 |
End bp | 2517728 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643699254 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002355870 |
Protein GI | 217970636 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAAG GCCGCCTTGG GCTTGAGACC GCACGCATCC CCCACGCTGA CCGGCACGGC CTGCTATGGC TCGAGCGCGG AGAGTTGTGT GTGGTCGACG GCTGTCTGCA CTTCATGGCG GGAAAAGACA GCCTCACGCC CTACATCGAT CAGATCCCGC ATCAAGCAGT GTCGATGATC CTGCTGGGAC CAGGCAGCAG CGTCACCCAC GACGCCCTGC GCCTGCTCGC CCGACATGGC ACGCTGATGG CGGCGGTTGG CACGGATGGC GTACGCAGCT ACACCGCGCC AGCCCTGCTG CCCGACCGCT CGGACGTCGC GCGCCGACAA GCGGAGCTGT GGGGCAATCC GCGCCGGCGC ATTGCCGTCG CCCGCCGCAT GTACGCCCTG CGCCTGGGCG AAATATTGCC TCACCGTGAT CTGGACACGC TGCGTGGCAT CGAAGGCTCG CGGGTGAAGA CGCTTTACCG GCTAACCGCC GAGCGCTACC GCATTCCCTG GAACGGTCGC CATTACGACC GCGCTGCGCC CGAAGCGACC GATACACCGA ATCAGGCAAT CAATCATGCG GCCACTGCTG TGCAGGCCGC GGCGGCGATC GCAGTACAGT CGCTGGCTGC AATTCCGCAG CTTGGCTTCA TTCACGAGGA CTCGGGTCAG TCCTTCGTTC TCGACATTGC CGATCTGTTT CGCGATTCGA TCACGTTGCC GATCGCATTT GCCGCGGCAC GCAAGGCACT CGACGGTGCG CCAGACACTA TCGACCGATT GGTGCGACGC GAGGCGGCAG CGGTTTTCAG GAAGCAGTCC GTGATTCCAA CAATGATCGA CAAGATCAAG GCTGTGTTGC GCATGGAGGA AGTAGATGCC GCTGGTGGTG ATAGTGACGC GTGA
|
Protein sequence | MLKGRLGLET ARIPHADRHG LLWLERGELC VVDGCLHFMA GKDSLTPYID QIPHQAVSMI LLGPGSSVTH DALRLLARHG TLMAAVGTDG VRSYTAPALL PDRSDVARRQ AELWGNPRRR IAVARRMYAL RLGEILPHRD LDTLRGIEGS RVKTLYRLTA ERYRIPWNGR HYDRAAPEAT DTPNQAINHA ATAVQAAAAI AVQSLAAIPQ LGFIHEDSGQ SFVLDIADLF RDSITLPIAF AAARKALDGA PDTIDRLVRR EAAAVFRKQS VIPTMIDKIK AVLRMEEVDA AGGDSDA
|
| |