Gene Cpha266_1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1433 
Symbol 
ID4568994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1635196 
End bp1636296 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID639766019 
ProductCRISPR-associated Csd2 family protein 
Protein accessionYP_911885 
Protein GI119357241 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02589] CRISPR-associated protein, Csd2 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.562068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACTG TAGATAAACG CTATGATTTT GTTCTACTTT TCGATGTTCA GGACGGAAAC 
CCGAATGGCG ATCCCGATGC TGGAAATTTT CCACGAATCG ATGCTGAAAC CGGGATAGGG
CTTGTGACTG ATGTTTGCTT GAAGCGCAAG GTTAGAAATT TTGTGCAGAT AAATGATGTT
CAACAGCCGG GTTATGATAT TTATGTCAAG GAGAAAGCTG TGCTTGGTTT GGCTCATTTC
AAGGCATTTA AAGAGTTAGG AATCAGTACT GGCGAAATGT CAAAAAAAGT CATTAACGAT
CAAGAGATGA TTGAAATATT TTCTGATCTG ACTTTACCAG AAGGGGTATC TTTTGCCGAG
AGTGATGAAC ATGGAGTATT ATCAATTGCT GCTGATGCGG ATAAAAACGA GATCAAAGAT
TGGATGAAAG CTGAAAAAGA GAGCCTTTCT AAAAATGTTA TAAAAGTTAT TAGTGAGGCT
CTGAAAGAAG CAAAGCCACG GAAGCCAACT GCTGAAGAAA CAAGTAGAGG CAAGGAAAAA
ATGTGTCAGG ATTACTATGA TATTCGAACT TTTGGTGCGG TAATGTCTTT GAAGTCAGCT
CCCAATTGTG GGCAGGTGCG CGGGCCGATT CAGATGACGT TTGCCCGCTC AGTCGAACCG
ATCGTGGCGT TGGAGCACAG CATTACACGA ATGGCTGTAG CAACGGAAGC TGAAGCGGAA
AAGCAGAGCG GCGACAACCG AACGATGGGC CGCAAATACA CCGTACCATA TGGCTTGTAT
CGCGCTCATG GATTCGTGTC AGCAAACCTC GCCCATCAGA CAGGTTTTTC TGAAAACGAT
CTCGATCTGT TCTGGAACGC TCTTTTGAAT ATGTTCGATC ATGACCGTTC GGCAGCTCGC
GGGCTGATGT CCACGCGCGG CCTGTATGTT TTCGAGCACA GCTCAGTTTT GGGTAATGCT
CCGGCAAGCC AGCTTTTCGA GCGAATCACG GTCAAGCGCA AAGAGGATTC CGAAGGTCCT
GCTCGTTCGT TCAAAGAATA CGATGTGCTG ATAGATGAAT CCAGTCTTGG TGAGGTGAAG
CTTCTCCGAA AACTTGGATA A
 
Protein sequence
MTTVDKRYDF VLLFDVQDGN PNGDPDAGNF PRIDAETGIG LVTDVCLKRK VRNFVQINDV 
QQPGYDIYVK EKAVLGLAHF KAFKELGIST GEMSKKVIND QEMIEIFSDL TLPEGVSFAE
SDEHGVLSIA ADADKNEIKD WMKAEKESLS KNVIKVISEA LKEAKPRKPT AEETSRGKEK
MCQDYYDIRT FGAVMSLKSA PNCGQVRGPI QMTFARSVEP IVALEHSITR MAVATEAEAE
KQSGDNRTMG RKYTVPYGLY RAHGFVSANL AHQTGFSEND LDLFWNALLN MFDHDRSAAR
GLMSTRGLYV FEHSSVLGNA PASQLFERIT VKRKEDSEGP ARSFKEYDVL IDESSLGEVK
LLRKLG