Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0057 |
Symbol | |
ID | 2686100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 72630 |
End bp | 74309 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637124722 |
Product | CRISPR-associated Cas1/Cas4 family protein |
Protein accession | NP_951119 |
Protein GI | 39995168 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1468] RecB family exonuclease [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAGA CAGACGGGAG TATTCCTCTC ATCCCGGTTC GCATGCTCAA CGAACACGTC TACTGCCCGC GACTGGCCTA TCTCATGTGG GTGCAGGGTG AGTTCTCCCA CAATGAGTTC ACGGTTGATG GCGTTATCCG CCACCGCAGG GTCGATGCTG GCGGCGGAGT GCTGCCCTCC GAGACCCAGG AGGATTCCAG GATACATGCC CGCTCGGTGA GTCTCAGCTC GGAACGGCTG GGAATTACCG CCAAGATCGA TCTGGTGGAA GGGGAGGGAG CATACGTTTC TCCTGTCGAT TACAAGCGGG GCAAACGCCC CCATGTGGCC GGCGGAGCAT ACGAGCCGGA GCGGGTTCAG CTCTGTGCCC AGGGGCTTCT TCTGCGGGAG CACGGATTTG CCAGCGATGG CGGCGCTCTC TACTTCGTTG CCTCCCGCGA ACGGGTGCCG GTTGCATTTG ATGATGAACT GATCGGAAGA ACCCTGGCCG CCATTGATGA GATGGGACGC ACGGCGTTGT CGGGCACGAT GCCCCCGCCG CTGGAGGACA GTCCCAAATG CCCTCGCTGC TCGCTGGTGG GGATCTGTCT GCCGGATGAA GTGCGCTTTC TCTCCCATTT GTCGGTGGAG CCCCGCCCGA TCATCCCGGC CGACGGGCGG GGGCTTCCTC TTTATGTCCA GTCGCCTAAG GCCTATGTGC GCAAGGACGG CGATTGCCTG GTCATCGAAG AGGAGCGGGT ACGGGTGGCC GAGGCCCGGT TGGGGGAGAC GTCGCAGGTG GCGCTCTTCG GCAACGCGAC CCTCACGACG GCGGCCCTCC ACGAATGTCT GCGCCGGGAG ATTCCCGTCA CTTGGCTCTC CTACGGGGGC TGGTTCATGG GGCATACCGT CAGCACGGGG CACCGCAATG TGGAAACCCG CACCTACCAG TACCAGCGGA GCTTTGATCC GGAGACCTGC CTGAACCTCG CCCGGCGCTG GATCGTAGCC AAGATCGCCA ACTGTCGGAC GCTGCTGCGG CGCAACTGGC GGGGGGAAGG TGACGAAGCA AAGGCGCCCC CCGGTCTGCT CATGTCGCTG CAGGATGACA TGCGCCACGC AATGCGAGCC CCTTCGCTGG AGGTGCTGCT CGGCATCGAG GGGGCTTCCG CCGGCCGCTA CTTTCAGCAT TTCAGCCGGA TGCTCCGCGG TGGTGATGGC GAAGGGATGG GTTTTGACTT CACCACCCGC AACCGCCGTC CGCCCAAGGA TCCGGTCAAT GCCCTGCTCT CCTTCGCCTA TGCCATGCTC ACCCGGGAGT GGACCGTGGC GCTCGCCGCC GTGGGACTCG ATCCCTACCG GGGCTTCTAC CATCAGCCCC GCTTCGGCCG TCCGGCCCTG GCTCTTGACA TGATGGAGCC GTTTCGGCCG CTGATCGCGG ATTCAACGGT GCTTATGGCA ATCAATAACG GCGAGATCCG CACCGGCGAC TTCGTCCGTT CCGCCGGCGG CTGCAACCTG ACCGACAGCG CACGCAAGCG TTTCATCGCT GGGTTCGAGC GCCGTATGGA GCAGGAGGTG ACACACCCCA TCTTCAAGTA CACAATCAGT TACCGGCGGC TGCTGGAGGT GCAGGCGCGG CTTCTGACCC GTTACCTTTC GGGGGAGATC CCCGCCTATC CGAACTTTGT CACGAGGTGA
|
Protein sequence | MAETDGSIPL IPVRMLNEHV YCPRLAYLMW VQGEFSHNEF TVDGVIRHRR VDAGGGVLPS ETQEDSRIHA RSVSLSSERL GITAKIDLVE GEGAYVSPVD YKRGKRPHVA GGAYEPERVQ LCAQGLLLRE HGFASDGGAL YFVASRERVP VAFDDELIGR TLAAIDEMGR TALSGTMPPP LEDSPKCPRC SLVGICLPDE VRFLSHLSVE PRPIIPADGR GLPLYVQSPK AYVRKDGDCL VIEEERVRVA EARLGETSQV ALFGNATLTT AALHECLRRE IPVTWLSYGG WFMGHTVSTG HRNVETRTYQ YQRSFDPETC LNLARRWIVA KIANCRTLLR RNWRGEGDEA KAPPGLLMSL QDDMRHAMRA PSLEVLLGIE GASAGRYFQH FSRMLRGGDG EGMGFDFTTR NRRPPKDPVN ALLSFAYAML TREWTVALAA VGLDPYRGFY HQPRFGRPAL ALDMMEPFRP LIADSTVLMA INNGEIRTGD FVRSAGGCNL TDSARKRFIA GFERRMEQEV THPIFKYTIS YRRLLEVQAR LLTRYLSGEI PAYPNFVTR
|
| |