Gene Cagg_3318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3318 
Symbol 
ID7267794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4021266 
End bp4022759 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content58% 
IMG OID643568131 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_002464602 
Protein GI219850169 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.420687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000494763 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACGC TTTCTGAATC GGTCTGGATC CTCGGCGACC AACTCAGCCC GACCCATCCC 
GGTTTGGCCC CCAACCGCCG CGTAGTGCTG ATCGAATCGC TGGCTCGCCT TAATCAACGC
CCGTACCACC GGCACAAGCT GGTGCTGATT ATCAGCGCCA TGCGCCATTA CGCTGCTGAA
CTACGTGCAG CCGGCTACGA CGTCGATCTC CGTGTAGCAC CCGATTTTCT GAGCGGCTTA
CGTGATCATG TCAATACGTT TGGGGTACAG CACTTGTACT GCCTCGCCGC CGCCGAATAT
GCCACCCGCC AATTCCAACA CCGTCTCACA GCCGAACTCG GTATTCCGGT GACGATCCTG
CCTAATACTC TCTTCCTCGT CGAACGCTTC CCTCCGCCCC GCTCACCGGC GTTGATGGAA
CCGTTTTACC GCATGATGCG CCGGCAGACC GGCCTCTTGA TTGAGCCAGA CGGTCAACCT
ACCGGTGGTG TGTGGAATTT TGACCGCGAG AATCGTCGTC GGTACGATGG AAGACCGGTA
CCGCCGCCGT TGCGCTTCCC GCCCGACACA ATTACCCGGC AGACGATTGC CGACCTCGCC
GTTGCCTGTC CTCACGCCAT CGGGAGTGTC GAGGAGTTCG ACTTGCCGGT TACGCGAGCC
CAAGCCCTCA CCGCACTCGA TGATTTTATC ACGCACCGTT TACCTGACTT CGGCCCGTTT
GAAGACGCGA TGAGCGCCGA GCACGAGCTG CTCTTCCATT CCCGCCTCTC ACCGCTGCTC
AACATCGGCT TGCTCGATCC GCTGGAAACG GCGCAAGCTG CGGTGAACGC CTACGAGCGC
GGCCACGCAC CGTTGTCCTC GGTCGAAGGC TTTGTACGCC AGATTATCGG CTGGCGCGAA
TATATCTATT ACCGCTACTG GGAACTAATG CCCGACCTCT TGCAGGCCAA TGCGTGGCAG
GCCGAACGAC CGTTACCGGC ATGGTACTGG ACGGGTCAGA CTCGTATGCG CTGTTTGCAC
TGCGTGATTC AACGAGTGTT GCAGAATGGT TATTGCCACC ATATCGAGCG CCTGATGGTG
CTGTGCAATT TTGCTATGCT GGCCGGGGTA CAGCCTCAAG CGGTGAACGA CTGGTTTCTC
GAATGTTATG TCGATGCGTA TGAGTGGGTA GTCACGCCGA ATGTGATCGG GATGGGGTTA
AACGCCGACG GGGGACGCAC GGCGACCAAG CCATACATTG CCAGCGCCGC TTACATCGAT
AAGATGAGCG ACTACTGCAA AGGTTGTTTC TACGATCGAA AGGCTCGGGT CGGGCCACGT
GCTTGCCCAT TTAACACCCT CTACTGGAAC TTTCTTATCA ACCACGAAGA GCGTCTGCGC
GCTAACCCTC GTCTTGGCCC GGCGGTGTTG GGGCTAAGCC GGCTTAGCGC CGCTGAGCGG
GAAGCAATTG CGGCGCAGGC AGAAGAGGTG TTAGAACGAA TAGAGCAGTT GTAG
 
Protein sequence
MTTLSESVWI LGDQLSPTHP GLAPNRRVVL IESLARLNQR PYHRHKLVLI ISAMRHYAAE 
LRAAGYDVDL RVAPDFLSGL RDHVNTFGVQ HLYCLAAAEY ATRQFQHRLT AELGIPVTIL
PNTLFLVERF PPPRSPALME PFYRMMRRQT GLLIEPDGQP TGGVWNFDRE NRRRYDGRPV
PPPLRFPPDT ITRQTIADLA VACPHAIGSV EEFDLPVTRA QALTALDDFI THRLPDFGPF
EDAMSAEHEL LFHSRLSPLL NIGLLDPLET AQAAVNAYER GHAPLSSVEG FVRQIIGWRE
YIYYRYWELM PDLLQANAWQ AERPLPAWYW TGQTRMRCLH CVIQRVLQNG YCHHIERLMV
LCNFAMLAGV QPQAVNDWFL ECYVDAYEWV VTPNVIGMGL NADGGRTATK PYIASAAYID
KMSDYCKGCF YDRKARVGPR ACPFNTLYWN FLINHEERLR ANPRLGPAVL GLSRLSAAER
EAIAAQAEEV LERIEQL