Gene Ppro_2337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpro_2337 
Symbol 
ID4574497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter propionicus DSM 2379 
KingdomBacteria 
Replicon accessionNC_008609 
Strand
Start bp2523418 
End bp2524341 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content63% 
IMG OID639756387 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_902002 
Protein GI118580752 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00747033 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAC CAATTCTTCC TCCCCTGAAA CCGCTCCCCA TCAAGGACCG CATCTCGGTG 
ATCTATGTGG AGCGTGGCAA CCTGGATATC CTTGACGGGG CGTTCGTGGT GGTGGATGCC
ACCGGCGTGC GTACCCACAT ACCCATCGCT ACGGTGGCCT GCCTGATGCT GGAACCGGGG
GCGCGGGTGT CACATGCGGC CGTTGTGCTG GCGGCGCGGG TAGGGTGCCT GCTGGTCTGG
ATCGGAGAGG CCGGGGTGCG GTTGTACGCG GCGGGACAAC CGGGAGGGGC TCGTTCCGAT
CGGTTACTGT ACCAGGCCAA GCTGGCTCTG GACGATACGG CCCGGCTCAA GGTGGTGCGC
AAGATGTACG CCCTGCGCTT CAAGGAGGAA CCGCCTGAAC GGCGCAGCGT GGAACAGTTG
CGCGGAATCG AAGGGGTGCG GGTGCGCAAG ATGTACGAAC TGCTGGCCCG CCAGTACGGT
GTGGAGTGGA AAAACCGTAA TTATGATCAT AGCGAATGGG GGAGCGGCGA CCTGCCCAAC
CGCTGTCTCT CGTCGGCCAC CGCCTGTATT TACGGTATCT GTGAGGCCGC CATTCTGGCG
GCGGGGTATG CGCCTGCCGT GGGGTTCATT CACACCGGCA AGCCGCAGTC TTTTGTCTAC
GATATCGCCG ATATCTTCAA GTTCGAGACG GTGGTGCCGG TGGCTTTCCG TGTTGCCGCC
AAAAAGCCGC GCAACCCGGA GCGGGAGGTG CGGCTGGCTT GCCGGGATTC GTTCCGCCAG
ACCAAACTTT TGCAGCGGAT TATCCCCACC ATTGAGCAGG TGCTGGCCGC TGGTGAAATG
GAGCTGCCCA AGGCGCACGA GGAGGCGGTT GCGCCGGCCA TACCCAACAA GGAGGGGATC
GGGGATGCTG GTCATCGTGG TTGA
 
Protein sequence
MAEPILPPLK PLPIKDRISV IYVERGNLDI LDGAFVVVDA TGVRTHIPIA TVACLMLEPG 
ARVSHAAVVL AARVGCLLVW IGEAGVRLYA AGQPGGARSD RLLYQAKLAL DDTARLKVVR
KMYALRFKEE PPERRSVEQL RGIEGVRVRK MYELLARQYG VEWKNRNYDH SEWGSGDLPN
RCLSSATACI YGICEAAILA AGYAPAVGFI HTGKPQSFVY DIADIFKFET VVPVAFRVAA
KKPRNPEREV RLACRDSFRQ TKLLQRIIPT IEQVLAAGEM ELPKAHEEAV APAIPNKEGI
GDAGHRG