Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0122 |
Symbol | |
ID | 7408484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 143801 |
End bp | 145441 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643714530 |
Product | CRISPR-associated CXXC_CXXC protein Cst1 |
Protein accession | YP_002572053 |
Protein GI | 222528171 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGAGA GGGTTTATTT AGGTGACTGG GCTTATAATG CAGGGATTAT AGGCTTTATT GAGATTATGC TGGATGGAGA AGATATAGAT TCTCAAAACA TTATAACTAT TGGACTAAAT TACATTGAAT TTGAAAGAGA AAGTTTGCGA GGGTTTTCCG ACAAGTTTTT TAAGAAAGCT TATCAGAGGT ATCCAAGGAC AGATGAGATT ATAAATGAAG GAAAGGATTT ACTGGAACAA CTAAATAACC GAAGTGATAT AGATGAACAG CAGAGAGAGA GAATAAGAAA GTTTAAAGAC AGAGTTAATG GTTTTGCAAA ACTGAGCAGG CTTGCGAAAG AATATGGATG CAGTTTGAAT AAAAAGTTTA ATAAAAATGA AGCAGTAGAT TTTGTAAACA CGATAATAAA AATTTTGGAA GATAGAAAAC AGGAGTTTAT GGAAAACGAT GTAAAGGTTT ATTTGAACAG TGTTAGCTCA GTATATGGTG AAGCAAGCTT TCTAAATAGG CAAATTACAG AAAACCTAAA AGAAAAGTTT TACAACGACT TTGAAAAACC TATAATAGAG AAGGCCAATG AAGAGGACAA GAAGTATCCC TGTATATTTT GCGGTGAGAG AAAAGCTAAA AAAGGTGCAA TGTTTAACAC AGGGATTGTG AATTTTCTTG GAGCAAACAA AGATAACAAG AATTTCTTCT GGAACTTTAA GCCTCAGCTG CCTATATGCG AAATCTGTGA GCTTATGTAT TTCTGTATTT TTGCGGCCCT GACTGAATTT AGAGTTGGAC AAACCAAAAG GTTTTACTTT GTAGATAAAA GTACTTCTGT TTTGGAGCTA TATCAAGCAA ATAAATTGTT TATGGAAATA ATGTCAAAAG AAGAAAATTT ACTCAAAGAC AAAGGAATTT TGAACTTCAT AAATGATTAC TTATTGTTAA AACTGAGGGA AGAAAGCAAG TTTGCCCTAA CTAACGTTCT TTTTGTTGAA ATAGATCTCT CTTCAGTTGC TCCAAAGGTT TATGGCTTCA ATATATCCAA GCAAAAGGCT GAATTTGTAA CCTCCAATTA TGAATTTTTT GAAAATGTTG TGGGAACCAA AATAACTGTG AAAGACAATA CCTTGTATCC TTTCCACGAG CTTTTGCAGA GGTTTTTAAA TAATACGCTA AGCTTTCAGT TTGTATCATT TTTAGAAAGT CAGTTTATAA GCTCTAAAAA AGTGAATTCA AAAATTAAAA CAAACCTTTC ACCCTATAGG CTTCAAATGT TTAACATTAT CACATATAAA TTTCTAAAAA GCATAAAAAG AGGTGAAATG TTGATGGATG AAAAAAGTTT GTGGAGGATG TACTTTTTTG GACAGGAGTT AAAGAAAACA TTCTTAAAAT CAGGTGCAGA AAACAAGATA ACGAGCCTTG CATATCGTTT GATTTCAGCA CTTAGAATTG GTGACATAAA CACCTTTATG AATTTAGTTA TTAGAACTTA TATGAACTAT AACATGGAAG TACCGGCTTT ATTTGTTTCG TGTATAAATG ATAAAGACAA TTTTTGCGCA TTAGGATATA GCTTTGTAAA CGGGCTTTTG GGAAGTGAAA GAGATGAAAG ATTAGAAAAT GAGGAGGATG AGGAGAAATG A
|
Protein sequence | MKERVYLGDW AYNAGIIGFI EIMLDGEDID SQNIITIGLN YIEFERESLR GFSDKFFKKA YQRYPRTDEI INEGKDLLEQ LNNRSDIDEQ QRERIRKFKD RVNGFAKLSR LAKEYGCSLN KKFNKNEAVD FVNTIIKILE DRKQEFMEND VKVYLNSVSS VYGEASFLNR QITENLKEKF YNDFEKPIIE KANEEDKKYP CIFCGERKAK KGAMFNTGIV NFLGANKDNK NFFWNFKPQL PICEICELMY FCIFAALTEF RVGQTKRFYF VDKSTSVLEL YQANKLFMEI MSKEENLLKD KGILNFINDY LLLKLREESK FALTNVLFVE IDLSSVAPKV YGFNISKQKA EFVTSNYEFF ENVVGTKITV KDNTLYPFHE LLQRFLNNTL SFQFVSFLES QFISSKKVNS KIKTNLSPYR LQMFNIITYK FLKSIKRGEM LMDEKSLWRM YFFGQELKKT FLKSGAENKI TSLAYRLISA LRIGDINTFM NLVIRTYMNY NMEVPALFVS CINDKDNFCA LGYSFVNGLL GSERDERLEN EEDEEK
|
| |