Gene Cagg_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2286 
Symbol 
ID7266699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2789643 
End bp2790722 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content56% 
IMG OID643567116 
Productrestriction endonuclease 
Protein accessionYP_002463601 
Protein GI219849168 
COG category[V] Defense mechanisms 
COG ID[COG1787] Predicted endonuclease distantly related to archaeal Holliday junction resolvase and Mrr-like restriction enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.227945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCAA GCTCACGTTC CAGTACAGAA ACTAATCTTC TCGCTTCTTT GCTTAGTATC 
CTCATCGTAT TCAGTCTATT CAGCGGGTTG TCGCTCTGGT GGATAATCAG CCTGATAGTT
GGTAGTCTCA CAATTGTTTT CTTTATTGCC CGCTCGCTGC ACCATGCACG TGTGCAACGG
TTGTATCGTC AGCAATTGTT AGCATTATCA CCGAGCGAAT TCGAGCAGCG CATCGCCCTG
TTGCTTGAAG ATCTAGGGTG GCAGAACGTT GTGGTGCGCG GTGGCAGTGG TGACCGCGGT
GTTGATATTA CCGCCCAACG AGACGGTTTG CGCTACATTA TCCAATGCAA ACGGTACACC
AAACCGGTCG GACCCAACTA CGTGCGCGAT CTCGTTGGCG CGCTCCAGAT TCAGCAAGCT
GACCGAGCCA TTTTGGTGAC GACCAGCACC TTCACCGATC AATCGCGTCT CGAAGCCCGC
GGGCAAGCTC TCGAATTGTG GGATCATCGA ATACTGTGGC AACGGATCGA AGAGGCCGAA
CAGCGACGAT TGACCAACCA GCAGCGCCGG AAACGGTCGG TGGCCCTCCC GGTTGCTTTC
GCGCTTGGTC TCAACCTCGT GGTTGCCGGC ATCGCCTTCA GCATCAGCGG ACCGCCGGTC
ATCAGCATAG ACCGCATTGG GCAACTGGTG CCGGTTGGAG AAACGAACGG TGAACGTGCG
ACGTCCCGGT CGTTGGGCAC CAATAGCCCT ACACTACAAT CAACGGTACC TAACCGTCCT
TCGGTCACAC CACGTCCCAC TCGTACCCCA CAACCAACGG CGACGCCACA ACCGACCTCC
ACACCGGTAC GACCGACCGC ATCGGTTTTT AATGGTGGGA ACGTGCGCGC TGCACCTAAC
CTCCAAGGCA CCGTCCTTGA TCAAATTCAC GCCTATGAAA CGGTCATCCT GCTTGGCCGC
AGTGCCGATG GGGTATGGAT ACGGATTATC AACCCGCGCG GCCAAGAGGG TTGGGTCCAC
CGCAGCCTAT TGACCCTTGA TCCGGCAATC GCCGAGACAC TGCCGGTGAT CACACCGTAG
 
Protein sequence
MPSSSRSSTE TNLLASLLSI LIVFSLFSGL SLWWIISLIV GSLTIVFFIA RSLHHARVQR 
LYRQQLLALS PSEFEQRIAL LLEDLGWQNV VVRGGSGDRG VDITAQRDGL RYIIQCKRYT
KPVGPNYVRD LVGALQIQQA DRAILVTTST FTDQSRLEAR GQALELWDHR ILWQRIEEAE
QRRLTNQQRR KRSVALPVAF ALGLNLVVAG IAFSISGPPV ISIDRIGQLV PVGETNGERA
TSRSLGTNSP TLQSTVPNRP SVTPRPTRTP QPTATPQPTS TPVRPTASVF NGGNVRAAPN
LQGTVLDQIH AYETVILLGR SADGVWIRII NPRGQEGWVH RSLLTLDPAI AETLPVITP