Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1141 |
Symbol | |
ID | 5103489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1077940 |
End bp | 1079418 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507033 |
Product | CRISPR-associated helicase Cas3 family protein protein |
Protein accession | YP_001191226 |
Protein GI | 146303910 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.607397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCCT TAGTTGATTA TTACGGAGAG GCTTGTAAGC TACAAGGGTT TGAGCCTAGG AAGGGGATAG AGGAGACCCT TTCCAAGATA GAGGAGGGGA AAGCAGTCAT CCTAACTGCC CCAACGGGTT ACGGCAAGAC CTCACTTACG TATGCCTTGG GACTGGCCAG TCTCAGGGGG AACGGGCACT TTGATAGGGT AATACACGTC CTGCCCCTTA GAAGTATTGT CCAGGACCTT ACCTCAAAGC TCGAGAACTT CATGAGGCTA GCGGGTTACT CCAACACAGT TGTTGGGGCC TACGACATGG ACTTCCACGA CACTCCTTAC TTCCTGAGAA AGGTGAACGT TACGACGCTT GACTCATTCG TATTGAACAT GTTCAAACTA CCTGTGTCCG AGATTACTAG GGGTATGAAG GGGATGGGGA CTCACTACGA GGTTCCGAGG GGGGCGATCT ACTCCTCAGT GGTTGTGTTT GACGAATTCC ATCTTTTCTC CGATGACGGA GGAAAGGACA AAAGCCTGAC CTCAGTGATA GCGTCACTTA GGGGACTTGG GGCGATGCAG GTCCCCTTCG TAATCATGAC GGCAACCCTT CCAGGTTCCC TTAGGGACTT GATCAAGGAG GAGCTTGAGG ACGTCGTTGA GGTCGTGGAG GTCAAGGACA ATTTCAAGAT AGAGAGGGAT GTTAGCGTTG ACTTCGTGGA TGAGCTGGAC TTCAATAAGC TTGATAGAAG GACTCTCGTG GTCATGAACA CCAGGAAGGG GGCCATCACC GCGTACCAAG AGGCCAAGAA GGCCGGACTG TCCCCCGTTC TGATTCACTC GAAGTTCAGC GCCATGGATA GGAGAAGGAA AGTTGACGAG ATCAAGAACG CTAAACTGGT CATCTCAACG CAGGTGATAG AGGCCGGAAT TGACGTTTCG TTTGACGTCC TCTACACCGA GGCTGCCCCA CTCCCCAACT TGGTCCAGAG GGCTGGCAGA GTTGCGAGGT ATGGCGGACA GGGAGAGGTT CACATTCTTC CCTTCAGCGG TCACGTCTAC GATCGGAACG ATGTTGAGAC GAGCCTTGAA ATTGTGAGAA GGGAGGGCAA ACTTGATTCG TCACTTATGT CGAGTTTTAA CACCAGTTAC ATCCTGAACT CCGATCTTCT GTTCTCGTTA AATATCTTGG ATGAGGGGCC CTTCTTCTCG TCGGAGGCAA CTGCGAAACT CCTTAAGAAG GAATGCTCGA TCACAAGGGA GACATCCCTT ATCATGGGCT TTCCCCAAGG ATGTAGATCC TCAGCCTGCG GGATCCCGCT AACTGAGGAT GAGGCTAAGG ACTTGTTGGA GAGAGGGGCC AAGCCACTTC GCGATGGAGA ACTAGTTGAC TGGAAACCTG GGAACCTTTG CCTCTCAATA GATTTCATGC TGAAGGGAAT TGACGGAATC TCTGTGGACT ACAATCAGGA GGTTGGGGCG ATACTATGA
|
Protein sequence | MSSLVDYYGE ACKLQGFEPR KGIEETLSKI EEGKAVILTA PTGYGKTSLT YALGLASLRG NGHFDRVIHV LPLRSIVQDL TSKLENFMRL AGYSNTVVGA YDMDFHDTPY FLRKVNVTTL DSFVLNMFKL PVSEITRGMK GMGTHYEVPR GAIYSSVVVF DEFHLFSDDG GKDKSLTSVI ASLRGLGAMQ VPFVIMTATL PGSLRDLIKE ELEDVVEVVE VKDNFKIERD VSVDFVDELD FNKLDRRTLV VMNTRKGAIT AYQEAKKAGL SPVLIHSKFS AMDRRRKVDE IKNAKLVIST QVIEAGIDVS FDVLYTEAAP LPNLVQRAGR VARYGGQGEV HILPFSGHVY DRNDVETSLE IVRREGKLDS SLMSSFNTSY ILNSDLLFSL NILDEGPFFS SEATAKLLKK ECSITRETSL IMGFPQGCRS SACGIPLTED EAKDLLERGA KPLRDGELVD WKPGNLCLSI DFMLKGIDGI SVDYNQEVGA IL
|
| |