Gene Cpha266_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2054 
Symbol 
ID4568738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2379403 
End bp2381598 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content53% 
IMG OID639766635 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_912490 
Protein GI119357846 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair
[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATGGC TCTACAATCA GATGGCAATG CCCGAAACCA TTTTTCAGGC CTGGTACAAA 
GTGGCATCGA ACGATGGGCG CCCGGGATGG GACAATACAT CCATTCAGGA CTACTCCCTT
CGGCTTGAAG AAAACCTGAA ATCCCTTTCG CACGCCCTGC TGACAGGAAC CTACAGGCAG
AGCCCGCTGC TGAAGCTCGT TATGCTGAAG CCTGACGGAA AAGAGCGGGT GCTTCTGATT
CCCGGTGTGA TTGACAGAGT TGCCCAGACA GCGGCATCAA TCGTGCTCTC ACCCATTATC
GAAGCGGAGC TTGGCAACTG CACCTTTGCC TACCGTCCGG GCATATCGCG TGAAGGAGCA
GCACGGGAGA TCGACCGGCT GCATCGCGAA GGGTACCAGT GGGTGCTCGA CGCCGACATC
CGCAACTTTT TCGACAATGT TCGCCACGAC CTGCTTTTTC AACGGCTTGT CGAACTTGTT
GACGACAAGG AGATGATCTC CCTGTTGCAC CGCTGGCTTA CCGCCGAAAT TGTTGACGGG
CTGAACCCCC GTACCCGCAA CACGATGGGT CTCCCTCAGG GATGCCCGAT ATCACCGGCA
CTGGCGAACC TCTATCTTGA CCGGTTTGAT GAAACAATGG AACAGCAGGG ATTCAAACTG
GTTCGTTTTG CCGACGACTA TCTTGTACTC TGTAAAACCC GTCCCAAAGC CGAAGCTGCC
CTCAAGCTCT CTGAAAGCGC GCTTGCTGAG CTGAAGCTTG AACTGCACAG CGATAAAACC
CGCATTACCA CCTTTGCCGA AGGATTCAAA TATCTCGGCT ACCTCTTTAT CCGATCGCTG
GTGATTCCCA CCAAAATGCA CCCTGAAGAG TGGTATGACA AGCTCGGCAA ATTCAGACTC
CGCAAAAAGA GCGTGCACGC CCTTCCCGCT GACCCCGAAA CCATGACCGG CGAAACATCG
CAGTTTGAAC TCGAAACCGA TCAGGGCGAA AAAATCGAAC TCTCCAAGGA GGAGCTTCTG
CAGACAGAGT TCGGCAAAAA ACTGCTCGAA AGCCTCGATA AAAAGCAGTT GAATGTTGAC
GAATTTCTTG AAAAGATCTC CAGGCAGGAC GAAGAGCGAC AGAAAGAGAA ACGGGAGGCA
CTGAAAAAAC TCTACTCCCC CTTTCTCAAT ACGCTCTATC TGCAGGAGCA GGGGAGCCTT
ATGCGCAAGG ATGGGGAGCG GTTCAGTATT GAAAAGGATG GGTCGGTCAT TAACGAAGTG
ATTGTTCGCC GCATCGAACA GGTTGTGATT TTCGGCAACG TCGCCCTCAC CACACCGGTC
ATGCAATACT GCCTGCAGAA CGAAATACCG GTCACCTTTC TTTCGCAGCA TGGCAAATAC
TTCGGCAGGC TTGAAGCAAC CACTGCCGAC AATGCTGAAA TGCAGCGCTT TCACTTTCTG
CGTTCCATAG ACGAACCCTT TGCGCTTGAA ACCGCCCGTT CCATCGTAGC GGCAAAAATC
AGCAACAGCA AAACCATGAT TCGCCGCCGA AAAACCGTGG TGCAGGATCG CGACAGCACT
CTGCAAAATA AAATGGCATA CAATCTCGAC ATCATGGCCG ATCTTGCCCT GAAAGCGGAA
GCATCCACTG ATATCGATGC GCTACGGGGG ATCGAAGGCA AGGCATCGGC ACTCTACTTC
GAATGTTACG GCATGCTTTT CAGCAAAAAC CTGCCCTTTC ACACCCGGTC GTTTCTGCGG
GTACGACGAC CGCCTACCGA TCCGGTCAAC AGTCTGCTCA GTTTCGGTTA CACCATGCTG
CACACCAACA TATTCTCGAT GGTGCAGGCA AGCGGCCTGA ACCCCTATAT CGGTTTTCTT
CACGCCGAAC GAAAAGGCAA TCCCGCTCTG GTCAACGATC TGGTCGAAGA GTTCCGCACG
ATAGTCGATT CACTCGTGCT CTACACCCTC AACCGGGGTC TTTTGCAGGA AAAAGACTTC
TACTACCGCA AAGATGAGCC CGGTTGCTTT CTGTCGAACG ACGCCCGCAA ACGGTTTTTA
AACATATTCG AAACAAGGAT GTGGCAGGAA TCCCGTGACG GCTGCACCGG CAAAACGCTC
AATTTCAGGC GGCATATCGA AAAACAGGTG AGGATCATGA GAGAGGTTAT AGCCGGAACC
CGAACGCAGT ACGACCCGTA CAAGCTACCG GTATAA
 
Protein sequence
MGWLYNQMAM PETIFQAWYK VASNDGRPGW DNTSIQDYSL RLEENLKSLS HALLTGTYRQ 
SPLLKLVMLK PDGKERVLLI PGVIDRVAQT AASIVLSPII EAELGNCTFA YRPGISREGA
AREIDRLHRE GYQWVLDADI RNFFDNVRHD LLFQRLVELV DDKEMISLLH RWLTAEIVDG
LNPRTRNTMG LPQGCPISPA LANLYLDRFD ETMEQQGFKL VRFADDYLVL CKTRPKAEAA
LKLSESALAE LKLELHSDKT RITTFAEGFK YLGYLFIRSL VIPTKMHPEE WYDKLGKFRL
RKKSVHALPA DPETMTGETS QFELETDQGE KIELSKEELL QTEFGKKLLE SLDKKQLNVD
EFLEKISRQD EERQKEKREA LKKLYSPFLN TLYLQEQGSL MRKDGERFSI EKDGSVINEV
IVRRIEQVVI FGNVALTTPV MQYCLQNEIP VTFLSQHGKY FGRLEATTAD NAEMQRFHFL
RSIDEPFALE TARSIVAAKI SNSKTMIRRR KTVVQDRDST LQNKMAYNLD IMADLALKAE
ASTDIDALRG IEGKASALYF ECYGMLFSKN LPFHTRSFLR VRRPPTDPVN SLLSFGYTML
HTNIFSMVQA SGLNPYIGFL HAERKGNPAL VNDLVEEFRT IVDSLVLYTL NRGLLQEKDF
YYRKDEPGCF LSNDARKRFL NIFETRMWQE SRDGCTGKTL NFRRHIEKQV RIMREVIAGT
RTQYDPYKLP V