Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1602 |
Symbol | |
ID | 7272144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1649480 |
End bp | 1650391 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643570215 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002466637 |
Protein GI | 219852205 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.19087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.843574 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCCC CTCTCAAACC CATTACCATC AAAGAACGAA TGTCGATGCT CTTCCTGGAG CGAGGAGAAC TGGATGTTCT CGATGGTGCA TTTGTCCTGG TCGATAAAAA CGGTGTCCGC ACCCAGATCC CGATCGGTTC AGTTGCATGC CTGATGCTCG AACCTGGGAC CAGGGTCTCC CATGCTGCCG TGGTGCTGGC AGCCAGGGTC GGATGCCTGC TGATCTGGGT TGGTGAGGCC GGGGTCAGGC TCTATGCGGC AGGACAGCCC GGCGGTGCCA GGTCTGACCG GTTGTTGTAC CAAGCAAAAC TCGCACTCGA CCCGGAGGCA CGCCTCAAAG TTGTACAGAA GATGTATGAG ATCCGGTTTC AGGGCCCTGT CCCCTCCCAT TACAGTATTG AACAACTCCG GGGTATGGAG GGGGCACGGG TCAGGGAGAT GTACGTGCAG ATCGGAAATC AGAACGGGAT TACCTGGAAA GGGCGGGGAT ATGATCACAC CGAATGGGAG AGCGGTGATC TCTCGAACCG GTGCCTGAGT TCTGCCACCG CATGTCTATA TGGGATCTGC GAGGCGGCGA TCCTGGCTGC GGGTTACGCG CCGGCTATTG GGTTTATTCA CTCTGGAAAG CCGCAGTCGT TTGTATACGA TATCGCCGAT CTTTTCAAAT TTGAGACTGT GGTCCCGGTA GCGTTTAGGG TTGCAGCGAA GAACCCAACT GATCCTGAAC GTATGGTCAG GCTCGGGTGC AGGGATATGT TCCGGGAGAC ACGAATCTTG AGGAAGATCA TCCCTGCCAT CGAGACGGTG CTGGCTGCCG GCGGGCTCCC GGTTCCACAG GCCCCTGCCG ATACATTGAA ACCTGCGATC CCAGTGGAGA AGGGGTTGGG CGATGCTGGT CATCGTGGTT GA
|
Protein sequence | MLPPLKPITI KERMSMLFLE RGELDVLDGA FVLVDKNGVR TQIPIGSVAC LMLEPGTRVS HAAVVLAARV GCLLIWVGEA GVRLYAAGQP GGARSDRLLY QAKLALDPEA RLKVVQKMYE IRFQGPVPSH YSIEQLRGME GARVREMYVQ IGNQNGITWK GRGYDHTEWE SGDLSNRCLS SATACLYGIC EAAILAAGYA PAIGFIHSGK PQSFVYDIAD LFKFETVVPV AFRVAAKNPT DPERMVRLGC RDMFRETRIL RKIIPAIETV LAAGGLPVPQ APADTLKPAI PVEKGLGDAG HRG
|
| |