Gene Cpha266_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1431 
Symbol 
ID4568992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1633521 
End bp1634552 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content50% 
IMG OID639766017 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_911883 
Protein GI119357239 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0863156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAGC TGCTCAACAC GCTGTTCGTT ACCACGCAGG GTGCATATCT GTCGAAAGAA 
GGTGAGTGCG CGGTCATAAA AATCGATAAA GTTGAAAAAG TACGATTGCC TCTTCATATG
CTTGACGGTA TTATCTGTTT CGGACAGATC ACCTGCAGCC CATTTCTCAT GGGTCACTGT
GCAGAAAAGG GTGTGACGGT GACGTTCCTT ACCGAATACG GCAAATTCCT CTGTCAGGTG
CAGGGGCCAA CGAAAGGCAA CATTCTGCTC AGGCGCGCAC AATACCGGCA AGCAGACAAC
TATCAGCAGT CGGCCATGCT TGCCCGGGCG TTCGTCATAG GAAAAATCGG CAACAGCAGG
GTTACCCTTG CAAGGGCCAT GCGGGACCAC CCCGAAAAGG TCGATGGCGA AAAAATGCAT
TATGCACAGC AACTGCTTGC CGGTTGCATT AAAAAGCTTG GCGATGAAAC CGATCAAGAG
CGAATCAGGG GGATCGAGGG TGAAGCAGGA AGAATTTATT TCGAGGTATT CGACCAGTGC
ATCACAACTT CCGACCCGTT GTTCCGGTTT AATGGCCGAA ACCGTCGACC GCCGGTTGAC
CGGGTAAATT GTCTGCTTTC GTTTCTCTAT ACCCTTGTGA CGCATGATAT CCGCTCCGCA
CTTGAGTCAT GCGGGCTCGA TCCGGCAGCG GGTTTTCTGC ACAAGGATCG CCCGGGTCGT
CCGAGCCTCG CTCTCGATAT GCTCGAAGAG TTTCGTTCCT ATATCGCCGA CAGAATGGCA
TTGTCGTTAA TCAATCGGGG TCAGATTCAG GCAAATGATT TCACGGTATC CGATACTGGC
GCTGTGCTGA TGAAAGACGA TGCAAGAAAA ACCTTGCTAA CGGCTTACCA GAAAAGAAAA
CAGGAAGAAA TAGAACATCC GTATGTCAGG GAAAAAATGG CTGTGGGTCT CATCTGGCAT
ATGCAGGCTA TGTTGCTGGC TCGGTATATC CGGGGGGATA TCGATATGTA CCCTCCTTTT
GTCTGGAGGT AA
 
Protein sequence
MKKLLNTLFV TTQGAYLSKE GECAVIKIDK VEKVRLPLHM LDGIICFGQI TCSPFLMGHC 
AEKGVTVTFL TEYGKFLCQV QGPTKGNILL RRAQYRQADN YQQSAMLARA FVIGKIGNSR
VTLARAMRDH PEKVDGEKMH YAQQLLAGCI KKLGDETDQE RIRGIEGEAG RIYFEVFDQC
ITTSDPLFRF NGRNRRPPVD RVNCLLSFLY TLVTHDIRSA LESCGLDPAA GFLHKDRPGR
PSLALDMLEE FRSYIADRMA LSLINRGQIQ ANDFTVSDTG AVLMKDDARK TLLTAYQKRK
QEEIEHPYVR EKMAVGLIWH MQAMLLARYI RGDIDMYPPF VWR