Gene GSU1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1392 
Symbol 
ID2685854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1520569 
End bp1521489 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content63% 
IMG OID637126067 
ProductCRISPR-associated Cas1 family protein 
Protein accessionNP_952445 
Protein GI39996494 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCAC AGCTTCCTCC CCTTAAACCC ATCCCGATAA AAGATCGCAT CTCGGTTCTC 
TATGTGGAAA GGGGGAACCT CGATGTGCTT GACGGCGCCT TCGTGGTGGT GGACAAGACC
GGCGTCCGCA CCCATCTCCC CGTTGGCGGG GTGGCGTGTC TCATGCTGGA GCCGGGCACA
CGGGTATCCC ATGCAGCGGT GACGCTCGCC TCCCGGATCG GCTGCCTCCT CGTCTGGATC
GGCGAGGCCG GGGTCAGGCT CTACGCCTCG GGCCAGCCGG GCGGGGCGCG GGCCGACCGG
CTCCTCTATC AGGCGAAACT GGCCCTGGAC GATTCGGCTC GGTTGAAAGT TGTGCGCAAG
ATGTACGCCC TCCGCTTCAG GGAAGAACCT CCCGAGCGGC GGAGCGTGGA ACAACTGCGC
GGCATTGAGG GGGTGAGGGT TCGTAAGATG TATGAGCTTC TCGCCCGCCA GCACGGTGTT
GCGTGGAAGG CCCGCAACTA CGACCACACT CAGTGGGAAA GCGGCGATGT GCCGAACCGC
TGTCTGTCAT CGGCCACCGC CTGTCTCTAC GGTATCTGCG AGGCGGCGAT CCTGGCGGCG
GGCTATGCGC CGGCGGTCGG TTTCATCCAT ACCGGCAAGC CCCAATCGTT CGTCTACGAC
ATCGCCGACA TCTTCAAGTT CGAAACCGTG GTGCCGGTGG CCTTCCGGAT CGCCGCCAAA
AAGCCCCGCG ACCCGGAGCG GGAAGTGCGG CTCGCCTGTC GCGATGCCTT CCGTCAGTCA
AAGATTCTGC ATCGCATCAT ACCTACCATT GAGCAAGTGC TAGCAGCCGG TGGCATGGAT
GTGCCCACGC CGCCGCCCGA GTCGGTCGAA GCCGTAATTC CGAACAAGGA GGGGATCGGA
GATGCTGGTC ATCGTGGTTG A
 
Protein sequence
MQPQLPPLKP IPIKDRISVL YVERGNLDVL DGAFVVVDKT GVRTHLPVGG VACLMLEPGT 
RVSHAAVTLA SRIGCLLVWI GEAGVRLYAS GQPGGARADR LLYQAKLALD DSARLKVVRK
MYALRFREEP PERRSVEQLR GIEGVRVRKM YELLARQHGV AWKARNYDHT QWESGDVPNR
CLSSATACLY GICEAAILAA GYAPAVGFIH TGKPQSFVYD IADIFKFETV VPVAFRIAAK
KPRDPEREVR LACRDAFRQS KILHRIIPTI EQVLAAGGMD VPTPPPESVE AVIPNKEGIG
DAGHRG