Gene Cagg_0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0108 
Symbol 
ID7266846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp152482 
End bp153744 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content54% 
IMG OID643564980 
ProductDNA/RNA non-specific endonuclease 
Protein accessionYP_002461496 
Protein GI219847063 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1864] DNA/RNA endonuclease G, NUC1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.94824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.647846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGAC GCTATCTGTG GCTTTTACTT CTCCTCTGGG TTGCCTGCTT CAGCGCCTAT 
TCCGTACCGA CTGCGGTCAA CACATCAACG AGTCGACATC TCGCTCTTGG CAATCCGAGC
AATGCTGTAG CCGATCCCGC CCAGCCCAAC AACTACTTAA TCGAACGCGA GGCATATGCA
CTGGCGTATC AGCGCGACAG TGGAATTGCA CGCTGGGTAA GCTGGCATCT GACCTTGACC
GATTTTAGTC CGGCACAAAC CGATCGGTAT AGTGGCAATT TTATCGTCGA TCCTACCATC
AGTGCGCTTG GTTGGCCATA CGCGACGCAC AGTGATTACA CCAACACCGG CTACGACCGT
GGTCATCTCA CCCCTTCCGG CGATCGTCTC TCGAGCGATC TGGTACAGCG TGAGACCTTC
TACCTCGCCA ATATCGTACC GCAAGCTCCG GATAATAACC AAGGGCCATG GCGATTGCTC
GAAGAACATA CGCGCAACCG GGTGCGTGCC GGTAACGAAG CCTATGTGAT CGGGGGAACA
ATTGGGAGTA ATGGTACGAT TGGGAACGGG AAGATTGTTG TTCCTGCCGA GTTATGGAAA
GTCGTGGTCG TCTTACCGGA GGGCGATAAT GATCTCGCTC GCATCACTGC CGAGACTGAA
GTCGTCGCGG TGATTATGCC GAACATCAAT GGTCTCAGCA ACGATTATAC TGAGTACCGC
TCTTCAATCG CCTGTATCGA ACAACGGACC GGACTTGATC TGCTCAGCAA CGTAGCACCG
GAAATCCAAG CGGCATTGGC CGGACCACAC TGCTCAACCA ATCCCCAGAA AATCTTTCTT
CCGCTCGTGA TCGGTCAGGC CGAAGGCACG AATGAATCGA CGCCGACACC GATTCCACCA
ACACCGACGC CGACGCCGAT TCCACCGGCG GTACGCATCA CCTACATCGA ATACAATCCA
CCGGGTGATG ATGTAGTGGG TGAATACGTC CGTATCGTGA ACGAGGGAAC GACACCGGTT
GATCTAACCG ATTGGATATT ACGCGATGAG GCCGGAGCCA CGTTTGTCTT CCCATTTTCT
GTCGTACCCG CCGGTGGCCG CGTGCAGGTA TGGACGGGTA GTGGTATCAA TACCAATGAA
CATCTGTACT GGGGACGCAA TCAAGCCGTT TGGAATAATA CCGATCAAGA CGGCGATGGT
GATCGCGATA CTGCTTTTCT CATCGACAGG AACGGCAATG TGATCTCAAC ATACCGGTAT
TGA
 
Protein sequence
MSRRYLWLLL LLWVACFSAY SVPTAVNTST SRHLALGNPS NAVADPAQPN NYLIEREAYA 
LAYQRDSGIA RWVSWHLTLT DFSPAQTDRY SGNFIVDPTI SALGWPYATH SDYTNTGYDR
GHLTPSGDRL SSDLVQRETF YLANIVPQAP DNNQGPWRLL EEHTRNRVRA GNEAYVIGGT
IGSNGTIGNG KIVVPAELWK VVVVLPEGDN DLARITAETE VVAVIMPNIN GLSNDYTEYR
SSIACIEQRT GLDLLSNVAP EIQAALAGPH CSTNPQKIFL PLVIGQAEGT NESTPTPIPP
TPTPTPIPPA VRITYIEYNP PGDDVVGEYV RIVNEGTTPV DLTDWILRDE AGATFVFPFS
VVPAGGRVQV WTGSGINTNE HLYWGRNQAV WNNTDQDGDG DRDTAFLIDR NGNVISTYRY