Gene Cagg_3297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3297 
Symbol 
ID7267771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3995534 
End bp3996826 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content57% 
IMG OID643568109 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_002464582 
Protein GI219850149 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000489204 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGGTG AGTGGCGAAA GGCACGGTTG GGAGAAGTGG TACGTATCAA TCCCGATGCC 
CTTGGGAGCG ACTGGCCATT TTCTTACATT AAGTACGTTG ATATTTCGAG TGTTGGCGAA
GGTTCAATCG TAGAACCCCC GCGGATCCTA CGTTTGGATG AAGCGCCAAG CCGAGCAAAG
AGACTTGTCC GAGAGGGGGA CACGGTTCTT TCAACGGTTC GGCCTGGTAG ACGATCAATG
TTTTTCGTAA AAGAGCCAGA GCCTGAATGG GTGGTTTCCA CCGGCTTTGC TGTTTTGCGA
CCTTGTAGGG AATACATAGA ACCTCGCTAC CTCTACGCTT GTGTCTTTGA CCGAGGTCTC
ACTGAATTTC TCATCAAAAG GGAGAAGGGC GCTGCTTATC CGGCGGTCCT GCCAGAAGAC
ATAGCGGATG CGATTATCAA ACTCCCTCCC CTCCCCGAAC AACGCGCCAT CGCCCACATC
CTCGGCACGC TGGACGACAA GATCGAGCTG AACCGGCGGA TGAGCGAGAC GCTGGAGCAG
ATGGCGCAGG CGCTGTTCAA GGCGTGGTTC GTTGATTTCG ATCCCGTGCG CGCCAAATGT
AGGGGCGGGT TTGAAACCCG CCCCTACACC GACCTATTCC CCGACCGGCT GATGGACTCT
GAACTGGGGA AGATTCCGGA GGGGTGGGAT GTAGTCACGC TGCCTAAGCT GGTCGAAATC
AACCCGGGCC GTCCGCTACG CAAGGGCGAG ATCGCACCCT ACTTGGACAT GGCGAACATG
CCAACACGGG GCCACGCCCC GGACCAAGTG GCCCACCGCC CATTCACCTC GGGGACGCGA
TTTATCAATG GGGATACCCT GGTCGCTCGG ATCACCCCAT GTCTTGAAAA CGGCAAGACG
GCATTCGTGG ATTTTCTGGA GGAAGGGCAA GTCGGTTGGG GCTCTACCGA GTACATCGTG
CTTCATCCGA AACCGCCTCT CCCTGAAGAG TTCGGTTACT GCCTAGCAAG AAGCGATGCT
TTCCGCGAGT TCGCTATTCA AAGCATGACG GGAACAAGCG GTCGGCAGCG CGTACAGGCA
GACTCAATAG GCCATTTCAA GTTGCCACGT CCGCCCGATT CGGTCGCAGT AGCGTTTGGG
AGACTAGTTA AGCCGCTGTT TGCCCGATCA TCGGACGCCG TCCGTGAATC CCGCACCCTC
GCCGCCCTGC GCGACGCGCT GCTGACCAAG CTCATCTCCG GCGAGCTGCG GGTGAAGGAC
GCGGAGAAGT TTCTACGAGA GTGTGGATTA TGA
 
Protein sequence
MAGEWRKARL GEVVRINPDA LGSDWPFSYI KYVDISSVGE GSIVEPPRIL RLDEAPSRAK 
RLVREGDTVL STVRPGRRSM FFVKEPEPEW VVSTGFAVLR PCREYIEPRY LYACVFDRGL
TEFLIKREKG AAYPAVLPED IADAIIKLPP LPEQRAIAHI LGTLDDKIEL NRRMSETLEQ
MAQALFKAWF VDFDPVRAKC RGGFETRPYT DLFPDRLMDS ELGKIPEGWD VVTLPKLVEI
NPGRPLRKGE IAPYLDMANM PTRGHAPDQV AHRPFTSGTR FINGDTLVAR ITPCLENGKT
AFVDFLEEGQ VGWGSTEYIV LHPKPPLPEE FGYCLARSDA FREFAIQSMT GTSGRQRVQA
DSIGHFKLPR PPDSVAVAFG RLVKPLFARS SDAVRESRTL AALRDALLTK LISGELRVKD
AEKFLRECGL