Gene GSU0057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0057 
Symbol 
ID2686100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp72630 
End bp74309 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content64% 
IMG OID637124722 
ProductCRISPR-associated Cas1/Cas4 family protein 
Protein accessionNP_951119 
Protein GI39995168 
COG category[L] Replication, recombination and repair 
COG ID[COG1468] RecB family exonuclease
[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR00372] CRISPR-associated protein Cas4 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGA CAGACGGGAG TATTCCTCTC ATCCCGGTTC GCATGCTCAA CGAACACGTC 
TACTGCCCGC GACTGGCCTA TCTCATGTGG GTGCAGGGTG AGTTCTCCCA CAATGAGTTC
ACGGTTGATG GCGTTATCCG CCACCGCAGG GTCGATGCTG GCGGCGGAGT GCTGCCCTCC
GAGACCCAGG AGGATTCCAG GATACATGCC CGCTCGGTGA GTCTCAGCTC GGAACGGCTG
GGAATTACCG CCAAGATCGA TCTGGTGGAA GGGGAGGGAG CATACGTTTC TCCTGTCGAT
TACAAGCGGG GCAAACGCCC CCATGTGGCC GGCGGAGCAT ACGAGCCGGA GCGGGTTCAG
CTCTGTGCCC AGGGGCTTCT TCTGCGGGAG CACGGATTTG CCAGCGATGG CGGCGCTCTC
TACTTCGTTG CCTCCCGCGA ACGGGTGCCG GTTGCATTTG ATGATGAACT GATCGGAAGA
ACCCTGGCCG CCATTGATGA GATGGGACGC ACGGCGTTGT CGGGCACGAT GCCCCCGCCG
CTGGAGGACA GTCCCAAATG CCCTCGCTGC TCGCTGGTGG GGATCTGTCT GCCGGATGAA
GTGCGCTTTC TCTCCCATTT GTCGGTGGAG CCCCGCCCGA TCATCCCGGC CGACGGGCGG
GGGCTTCCTC TTTATGTCCA GTCGCCTAAG GCCTATGTGC GCAAGGACGG CGATTGCCTG
GTCATCGAAG AGGAGCGGGT ACGGGTGGCC GAGGCCCGGT TGGGGGAGAC GTCGCAGGTG
GCGCTCTTCG GCAACGCGAC CCTCACGACG GCGGCCCTCC ACGAATGTCT GCGCCGGGAG
ATTCCCGTCA CTTGGCTCTC CTACGGGGGC TGGTTCATGG GGCATACCGT CAGCACGGGG
CACCGCAATG TGGAAACCCG CACCTACCAG TACCAGCGGA GCTTTGATCC GGAGACCTGC
CTGAACCTCG CCCGGCGCTG GATCGTAGCC AAGATCGCCA ACTGTCGGAC GCTGCTGCGG
CGCAACTGGC GGGGGGAAGG TGACGAAGCA AAGGCGCCCC CCGGTCTGCT CATGTCGCTG
CAGGATGACA TGCGCCACGC AATGCGAGCC CCTTCGCTGG AGGTGCTGCT CGGCATCGAG
GGGGCTTCCG CCGGCCGCTA CTTTCAGCAT TTCAGCCGGA TGCTCCGCGG TGGTGATGGC
GAAGGGATGG GTTTTGACTT CACCACCCGC AACCGCCGTC CGCCCAAGGA TCCGGTCAAT
GCCCTGCTCT CCTTCGCCTA TGCCATGCTC ACCCGGGAGT GGACCGTGGC GCTCGCCGCC
GTGGGACTCG ATCCCTACCG GGGCTTCTAC CATCAGCCCC GCTTCGGCCG TCCGGCCCTG
GCTCTTGACA TGATGGAGCC GTTTCGGCCG CTGATCGCGG ATTCAACGGT GCTTATGGCA
ATCAATAACG GCGAGATCCG CACCGGCGAC TTCGTCCGTT CCGCCGGCGG CTGCAACCTG
ACCGACAGCG CACGCAAGCG TTTCATCGCT GGGTTCGAGC GCCGTATGGA GCAGGAGGTG
ACACACCCCA TCTTCAAGTA CACAATCAGT TACCGGCGGC TGCTGGAGGT GCAGGCGCGG
CTTCTGACCC GTTACCTTTC GGGGGAGATC CCCGCCTATC CGAACTTTGT CACGAGGTGA
 
Protein sequence
MAETDGSIPL IPVRMLNEHV YCPRLAYLMW VQGEFSHNEF TVDGVIRHRR VDAGGGVLPS 
ETQEDSRIHA RSVSLSSERL GITAKIDLVE GEGAYVSPVD YKRGKRPHVA GGAYEPERVQ
LCAQGLLLRE HGFASDGGAL YFVASRERVP VAFDDELIGR TLAAIDEMGR TALSGTMPPP
LEDSPKCPRC SLVGICLPDE VRFLSHLSVE PRPIIPADGR GLPLYVQSPK AYVRKDGDCL
VIEEERVRVA EARLGETSQV ALFGNATLTT AALHECLRRE IPVTWLSYGG WFMGHTVSTG
HRNVETRTYQ YQRSFDPETC LNLARRWIVA KIANCRTLLR RNWRGEGDEA KAPPGLLMSL
QDDMRHAMRA PSLEVLLGIE GASAGRYFQH FSRMLRGGDG EGMGFDFTTR NRRPPKDPVN
ALLSFAYAML TREWTVALAA VGLDPYRGFY HQPRFGRPAL ALDMMEPFRP LIADSTVLMA
INNGEIRTGD FVRSAGGCNL TDSARKRFIA GFERRMEQEV THPIFKYTIS YRRLLEVQAR
LLTRYLSGEI PAYPNFVTR