Gene Cpha266_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2061 
Symbol 
ID4569494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2392804 
End bp2393847 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content55% 
IMG OID639766642 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_912497 
Protein GI119357853 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGAGG CTTCTCTTTT TCTTCGCACG CTCTATATCC AGGAGCAGGG TTCGATGCTC 
AGGGTTGAGA ACGGGTGTTT TCGCGTGACG TGCGGGCACG ATGACGATGT TGCCGAACTG
CTTGAGGTGC AGTCAATCAA GGTTGGGCAG ATTGTGCTGT TCGGCGCCTG TATGATTACT
CCGGCTGCCA TCCGGCATTG CCTCATGAAC CGCATTCCCG TTGTGCTGCT TTCGCAGCAT
GGCGAGTATT TCACCCGTCT TGAATCGACC GATGATGTCA ATATCGATCT TGAGCGGTTA
CAGTTTCAGC GATCTGCTGA AGAGTCGTTT CCGCTGGAGT GTTCGCGAAC TATCGTGCGG
GCGAAGCTGC ATAATTCCGG GGTTTTGCTC AGGCGTCATG CTGAGTCATC GGGTTCAGAG
GCGCTCCGGC ACGCTGCTAC GCAACTGCGG CAACTGGAGG AGCATGTTGA TCGTGCCGAT
TCGATCGATG CGGTGCGGGG CTACGAAGGG AGCGGTGCGG CGACATATTT CGGCGTGTTC
GAGGATTTTT TTGATACCGG GGGGTTCATC TTCAGAGAGC GGGTCAAGCG TCCGCCGACC
GATCCGGTCA ATGCGATGCT GAGTTTCGGG TACAGTCTGC TTTTCAACAA CATTTTTTCG
ATGGCAAGAT TGCATCGGCT GCACCCTTAC GTCGGGTTTC TGCATGCCGA CAAGCCCGCT
CATCCTGCAC TTGTGAGCGA TCTGATCGAG GAGTTCCGCA CGCTTGTTGA CGGTCTCGTG
ATCGCGCTTA TCAACAAGCG GCTCATCAGC CCGGAGGAGT TTACCGTTGC GCGGCATGAT
GACGGAAAAC CCAAAGGGTG CTACCTCTCG GATGGAGCGC GCAAAACTTT TCTTCGCGAG
TTCGAAAACC TCATGCACCG GACAACGACC CACCCGGCAA CGGGCTATGA GGTAACCTGC
AGGAGGTGTC TTGACTTGCA GGTGGGGGAG TTTGCCCTTT ATCTCAAAGG GGAGAAACCG
TATACGCCAT ATCTGAGGAG GTGA
 
Protein sequence
MSEASLFLRT LYIQEQGSML RVENGCFRVT CGHDDDVAEL LEVQSIKVGQ IVLFGACMIT 
PAAIRHCLMN RIPVVLLSQH GEYFTRLEST DDVNIDLERL QFQRSAEESF PLECSRTIVR
AKLHNSGVLL RRHAESSGSE ALRHAATQLR QLEEHVDRAD SIDAVRGYEG SGAATYFGVF
EDFFDTGGFI FRERVKRPPT DPVNAMLSFG YSLLFNNIFS MARLHRLHPY VGFLHADKPA
HPALVSDLIE EFRTLVDGLV IALINKRLIS PEEFTVARHD DGKPKGCYLS DGARKTFLRE
FENLMHRTTT HPATGYEVTC RRCLDLQVGE FALYLKGEKP YTPYLRR