Gene Cwoe_4658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4658 
Symbol 
ID8735124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4961066 
End bp4962496 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content77% 
IMG OID646505287 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003396446 
Protein GI284046106 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR02765] cryptochrome, DASH family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0553081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.710404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT CGACCGCGAT CGTCTGGTTC CGCCGCGACC TGCGCCTGCA CGACCATCCG 
CCGCTCGTGC GCGCGCTCGC CGCCCACGCG CGCGTCGTGC CCGTCTTCGT GCTCGACCCG
GCGATCGTGC GCGGGCGCTT CGCGTCGGGC GCGCGGACGG CGTTCATGCT CGACTGCCTG
CGGGAGCTGG ACGCGGACCT GCGCGAGCGT GGCAGCGGGC TGGTCGTGCG CGAGGGCAGG
CCGGAGCGCG AGCTGCCGGC GCTGGCGCGC GAGATCGGGG CTGCCGCCGT CCACTGGGCG
AGCGACGCGA CGCCGTACGC GATCGCCCGC GACCGCCGTG TGCGCTCGGC GCTGGCGGCG
GCGCAGCCCG CGGTCGCGGC GGTCCCCGGG CCGGGCAACT TCGTCGCCGA CGTCGGCCGT
CCGCGCACGA GAGCCGGCGG GCCGTACACG GTCTTCACGC CGTTCCACCG CGCCTGGCAA
CAGCTGGAGC GCCGCACCGT CCACCGCACG CCGGCCGTGC TGCCGCCGCT GCCGGCGGGG
CTGCGCAGGG GCGCACTCCC GTCGCTCGCT GCGCTCGGCC TGACCGACGA GCTGAGCCCC
ACTGCGCGCG CGGTCGAACC CGGCGAGCGG GCGGCTCGCC GCGCCGCCGA GCGCTGGCTC
GACGGCCACC TCGGCGACTA CGCGCGCGAC CACGACCGGC TCGCCGGCGG GACCTCGGCG
CTGTCGCCGT ACCTCCACCA CGGCTGCCTC TCGGCGCGTG AGTGCGAGCA GCGCGCCGTG
CGCCGCGGCG GCGAAGGGGC GGAGGCGTTC GTCCGCCAGC TCGCCTGGCG TGACTTCTAC
GCCCACGTGC TGCTGCACCA CCCAGAGGAC GTGCGGCGCG AGCACCAGGA GCGGATGCGC
GCGCTCAGGT GGGAGCGGGA CGACGAGCTG CTGGCCGCGT GGCAGGACGG CCGCACCGGC
TTCCCGCTCG TCGACGCCGG CATGCGTCAG CTGCGCGCGA GCGGCTGGAT GCACAACCGC
GCGCGGCTCG TCACCGGCTC GTTCCTGACG AAGGACCTGC AGCTCGACTG GCGCGCGGGG
GAGGCGTGGT TCATGCGCTG GCTGCTCGAC GGCGACGTCG CCTCCAACAA CGGCAACTGG
CAGTGGATCG CGTCGGTCGG CGTCGACCCG GCTCCGGCGT TCCGGCGGAT CCTCAACCCG
GCGCTGCAGC AGCGCCGCCA CGACCCCGAC GGCGCCTACG TGCGCCGCTG GGTGCCCGAG
CTGGCGCGCG TGCCCGACGC GCTGCTGACT GAGCCGTGGC TGATGAGCGA GCAGCAGCAA
CGCGCCGCCG GCTGCCGCAT CGGCGCCGAC TACCCGGCGC CGATCGTCGA CCACGCGCAC
GAGCGCCGGC GCGCGCTGGA GCGCTACCGG GCGGCGGGCA GCGAGTCGTA G
 
Protein sequence
MTDSTAIVWF RRDLRLHDHP PLVRALAAHA RVVPVFVLDP AIVRGRFASG ARTAFMLDCL 
RELDADLRER GSGLVVREGR PERELPALAR EIGAAAVHWA SDATPYAIAR DRRVRSALAA
AQPAVAAVPG PGNFVADVGR PRTRAGGPYT VFTPFHRAWQ QLERRTVHRT PAVLPPLPAG
LRRGALPSLA ALGLTDELSP TARAVEPGER AARRAAERWL DGHLGDYARD HDRLAGGTSA
LSPYLHHGCL SARECEQRAV RRGGEGAEAF VRQLAWRDFY AHVLLHHPED VRREHQERMR
ALRWERDDEL LAAWQDGRTG FPLVDAGMRQ LRASGWMHNR ARLVTGSFLT KDLQLDWRAG
EAWFMRWLLD GDVASNNGNW QWIASVGVDP APAFRRILNP ALQQRRHDPD GAYVRRWVPE
LARVPDALLT EPWLMSEQQQ RAAGCRIGAD YPAPIVDHAH ERRRALERYR AAGSES