Gene Paes_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1413 
Symbol 
ID6458958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1538648 
End bp1539523 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content44% 
IMG OID642725398 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002016081 
Protein GI194334221 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0233391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0713718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTG AAAAGATAGT ACCCGAGAGC AAGCCATCTC GGTTGCTGAT AAAAATCACC 
CGAGACACCT TACCGCAGGT GAAGGATAAA TACCCGTTTC TCTATCTTGA GCGAGGGCGT
CTGGAAATAG ACGATAGCAG TATAAAATGG ATAGATTGCG ACTGTAACGT TGTCCGGTTA
CCTGTGGCGC AGCTCAATTG CTTGTTGCTT GGACCGGGGA CTGCTGTTAC ACATGAAGCT
GTAAAAGTTA TGGCAGCAGC AAATTGTGGT ATATGCTGGG TCGGGGAAGA TAGTCTGATT
TTTTATGCCG CAGGACAGAC GCCTACAAGT GATTCTCGGA ACTTTCGACG GCAAATGGTG
TTGTCTGCCG ATCCTGAAAA ATCTCTTAAA GTTGCTCAGC GCATGTTTGC CCGCAGATTT
CCTGATGCGA AACTTGAGAC TAAGAGCCTT CAGCAAATGA TGGGAATGGA AGGGTTGCGT
GTTCGTCAAC TTTATGTACA AAAAGCTCAA GAATACAAGG TGGGCTGGAA GGGACGACAA
TTTACTCCTG GAAAGTTTGA AATAGGGGAT ATAACTAATA GAATTCTGAC ATCATCAAAT
GCAGCTCTAT ATGGTATAAT TTGTTCTGCT GTTCACAGTA TGGGTTATTC TCCACACATG
GGTTTTATAC ATACAGGTAG TCCGCTGCCA TTCGTTTATG ATTTGGCAGA TTTATACAAA
GAGAATCTCT CGATTGATCT CGCCTTTCGA TTGACGGCGT TGATGGCCGG AACTTATGAT
AGGCACAAAA TTGCTACTGA ATTTCGCAGG AGAGTTATTG AGATGGATCT TCTTGCTCGT
ATTGGGCCTG ATATTGAAGA AATGCTTGGG AGGTAA
 
Protein sequence
MKTEKIVPES KPSRLLIKIT RDTLPQVKDK YPFLYLERGR LEIDDSSIKW IDCDCNVVRL 
PVAQLNCLLL GPGTAVTHEA VKVMAAANCG ICWVGEDSLI FYAAGQTPTS DSRNFRRQMV
LSADPEKSLK VAQRMFARRF PDAKLETKSL QQMMGMEGLR VRQLYVQKAQ EYKVGWKGRQ
FTPGKFEIGD ITNRILTSSN AALYGIICSA VHSMGYSPHM GFIHTGSPLP FVYDLADLYK
ENLSIDLAFR LTALMAGTYD RHKIATEFRR RVIEMDLLAR IGPDIEEMLG R