Gene Cpha266_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1961 
Symbol 
ID4570136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2274228 
End bp2275634 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content53% 
IMG OID639766542 
Productdeoxyribodipyrimidine photo-lyase type II 
Protein accessionYP_912398 
Protein GI119357754 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.351083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAC ATCGGCTGAC ACAAAAACAG AACGCCCTTA TGATTGATCC CCGCCGAACA 
AGAGTGTTGA ACTCCTGCAG TGACAAACCG GGAGCGGTTA TTTACTGGAT GTCGCGCGAT
CAGCGCCTGA ACCATAATTG GGCGCTGCTC TTTGCAAGAG AGAAAGCCGC TCGGAAAGGC
CAGCCTCTTG TTGTTGTCTT CGCTCTGGCC CCATCATTCC TCGACGCTCC GTTCAGACAT
TACGACTTCA TGCTTAAAGG TCTTGAAGAA ACCTCCAAAG CTCTCGAACG GATCAATATC
CCCTTTATGC TGCTCGAAGG AGAGCCGGAT ACAGAGATCT CACGATATGC CTGCCAGTCC
GAAGCAGGAG CTGTCGTTAC GGATTTTTCT CCCCTGAACA TTTCCCGAAA CTGGAAAAAA
AAGGCAGCCG ACATCCTCGA CATTCCTCTC TATGAGGTCG ATGCCCATAA CATTGTCCCC
TGCTGGTATG CATCCGACAA ACAGGAGTAT GCGGCCAGAA CCCTGCGCCC GAAACTGCAG
GCCCGCCTTG ATGAGTTTCT TGTTCCGTTT CCAACGATTC TGCCGCTTCC GGCACCTCAC
GTTCACCACC GCTCTCCCGA CTGGAAACAG GTCCGGGAAC GGCTCCAAAA AGATCGCTCC
GTACCGCCGG TGAACCGGAT CGCTCCTGGA GAAACGGCCG CAGCAGAATC GCTTGAAAAC
TTCATCAAGA GCAGGCTTTC GGGATATGCC ACGGCTAGAA ACGACCCGAA CAGCAATGCC
CTGTCACAAC TCTCTCCCTA CCTTCATTTC GGTCAGATCA GTGCCCAGCA TGTTGCGTTG
CGGGTTGCCG AAAGCCGTGC GCCACAGAAA GACAAGACGG CCTTTCTCGA GGAGCTGATT
ATCCGCAGGG AGCTTTCGGA TAATTTCTGC AACTACAACC CGAGCTATGA CCGGTTTGAA
GGGATCCCTG CATGGGCGAA GCAAACGCTG CTTCTTCATG GGCAGGACAA ACGGGAGTAC
CTGTACACCA TCGACGTTTT CGAAAAAGCT GCAACGCACG ACAAGCTCTG GAACGCTGCC
CAATCAGAGC TGGTTCAAAG CGGAAAAATC CACGGTTATA TGCGGATGTA CTGGGCGAAA
AAAATTCTCG AATGGAGTTC GTCTCCTCCC GAGGCATTTG AGATGGCGAT CTATCTCAAC
GACCGATATG CGCTTGATGG AAGGGATCCT AACGGTTATG CTGGGGTGGC ATGGTCGATT
GGAGGCTTGC ATGACCGCCC ATGGTTCGAA CGTCCGGTCT ATGGCAACAT CAGATACATG
AACGCCAGCG GGTGCAGAAG AAAGTTCGAC GTTGAGCGCT ACATAAGCCG GTTTCGGGAA
CCGGCGACAC TGTTCCCGAA TGCGTAA
 
Protein sequence
MSEHRLTQKQ NALMIDPRRT RVLNSCSDKP GAVIYWMSRD QRLNHNWALL FAREKAARKG 
QPLVVVFALA PSFLDAPFRH YDFMLKGLEE TSKALERINI PFMLLEGEPD TEISRYACQS
EAGAVVTDFS PLNISRNWKK KAADILDIPL YEVDAHNIVP CWYASDKQEY AARTLRPKLQ
ARLDEFLVPF PTILPLPAPH VHHRSPDWKQ VRERLQKDRS VPPVNRIAPG ETAAAESLEN
FIKSRLSGYA TARNDPNSNA LSQLSPYLHF GQISAQHVAL RVAESRAPQK DKTAFLEELI
IRRELSDNFC NYNPSYDRFE GIPAWAKQTL LLHGQDKREY LYTIDVFEKA ATHDKLWNAA
QSELVQSGKI HGYMRMYWAK KILEWSSSPP EAFEMAIYLN DRYALDGRDP NGYAGVAWSI
GGLHDRPWFE RPVYGNIRYM NASGCRRKFD VERYISRFRE PATLFPNA