Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1392 |
Symbol | |
ID | 2685854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1520569 |
End bp | 1521489 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637126067 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | NP_952445 |
Protein GI | 39996494 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCCAC AGCTTCCTCC CCTTAAACCC ATCCCGATAA AAGATCGCAT CTCGGTTCTC TATGTGGAAA GGGGGAACCT CGATGTGCTT GACGGCGCCT TCGTGGTGGT GGACAAGACC GGCGTCCGCA CCCATCTCCC CGTTGGCGGG GTGGCGTGTC TCATGCTGGA GCCGGGCACA CGGGTATCCC ATGCAGCGGT GACGCTCGCC TCCCGGATCG GCTGCCTCCT CGTCTGGATC GGCGAGGCCG GGGTCAGGCT CTACGCCTCG GGCCAGCCGG GCGGGGCGCG GGCCGACCGG CTCCTCTATC AGGCGAAACT GGCCCTGGAC GATTCGGCTC GGTTGAAAGT TGTGCGCAAG ATGTACGCCC TCCGCTTCAG GGAAGAACCT CCCGAGCGGC GGAGCGTGGA ACAACTGCGC GGCATTGAGG GGGTGAGGGT TCGTAAGATG TATGAGCTTC TCGCCCGCCA GCACGGTGTT GCGTGGAAGG CCCGCAACTA CGACCACACT CAGTGGGAAA GCGGCGATGT GCCGAACCGC TGTCTGTCAT CGGCCACCGC CTGTCTCTAC GGTATCTGCG AGGCGGCGAT CCTGGCGGCG GGCTATGCGC CGGCGGTCGG TTTCATCCAT ACCGGCAAGC CCCAATCGTT CGTCTACGAC ATCGCCGACA TCTTCAAGTT CGAAACCGTG GTGCCGGTGG CCTTCCGGAT CGCCGCCAAA AAGCCCCGCG ACCCGGAGCG GGAAGTGCGG CTCGCCTGTC GCGATGCCTT CCGTCAGTCA AAGATTCTGC ATCGCATCAT ACCTACCATT GAGCAAGTGC TAGCAGCCGG TGGCATGGAT GTGCCCACGC CGCCGCCCGA GTCGGTCGAA GCCGTAATTC CGAACAAGGA GGGGATCGGA GATGCTGGTC ATCGTGGTTG A
|
Protein sequence | MQPQLPPLKP IPIKDRISVL YVERGNLDVL DGAFVVVDKT GVRTHLPVGG VACLMLEPGT RVSHAAVTLA SRIGCLLVWI GEAGVRLYAS GQPGGARADR LLYQAKLALD DSARLKVVRK MYALRFREEP PERRSVEQLR GIEGVRVRKM YELLARQHGV AWKARNYDHT QWESGDVPNR CLSSATACLY GICEAAILAA GYAPAVGFIH TGKPQSFVYD IADIFKFETV VPVAFRIAAK KPRDPEREVR LACRDAFRQS KILHRIIPTI EQVLAAGGMD VPTPPPESVE AVIPNKEGIG DAGHRG
|
| |