Gene Cagg_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3337 
Symbol 
ID7267077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4049601 
End bp4050638 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content55% 
IMG OID643568147 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002464618 
Protein GI219850185 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.149909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000199825 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACGCGAA TCAACACCAC CTCCCCTTTT CGAGCACTAC CAGCGCCGGT CTATACCACC 
CGGTATGGGG CGTGTTACCT CGGCGATTCG CTCGACCTTC TCAAGCGCCT CCCAGACGAC
TGCGTTGACC TTGTGGTCAC ATCTCCTCCA TTCGCACTCC TGCGGCAGAA GGCATACGGC
AATACGGATC AATCCGAATA CGTCGAGTGG CTCTGTAGGT TCGGCGCGGA AGTGCGTCGT
GTGCTTCGCG AAACTGGCAG CTTTGTTCTC GACCTAGGTG GTGCATATCA ACGTGGCGTT
CCGGTACGGT CGCTGTATCA ATATCGCGTG CTCCTGAAGA TGTGCGATGA GGTCGGCTTT
TACCTTGCGG AAGAGTTCTT TTGGTACAAC CCCGCAAAGC TGCCTTCCCC CATTGAATGG
GTTAACAAGC GAAAAATCCG GGTGAAGGAT TCCGTAAACA CTGTATGGTG GTTCTCAAAA
TCGGAGTGGC CGAAAGCGGA CGTGCGACAG GTTCTTGCCC CTTACTCCGA GCGTATGAAG
ACGCTCCTGA AAAACCCGGA TGCGTTCTAC AAGCCAAAGA ACCGTCCATC AGGCCACGAC
ATCAGCAAGG GTTTTGGTTC TGACAACGGT GGTGCGATCC CTTCAAACCT GCTGCAGATT
CCCAATACCG AAAGCAACTC TTCCTATCTG CGGCTCTCCA AGCTCGTAGG AATCGAAGCA
CATCCTGCGC GCTTTCCTGC TGCACTCCCG GAGTTTTTCA TCAAGTTGCT CACGACTGAG
GGTGATCTGG TTCTGGACAT CTTCGCAGGG TCAAACACGA CGGGGAAAGT GGCAGAGCAA
TTTGGACGAA GGTGGATTGC GATGGAGATA GACGCGGGTT ACGTGGCCGG CTCGGCGCTC
CGGTTCATGG AGCACCTTCC GCAAAGTGAA GTGGCGGCAT GCTTCGCCCG ACTGTCCGTG
AGTTCAAGGA GCGCCCCTGT TGACCTTTCC GGACCGGCCA CGCTGTTCGA TGAAGCAAAG
GTGGAACGTG GCACCTAA
 
Protein sequence
MTRINTTSPF RALPAPVYTT RYGACYLGDS LDLLKRLPDD CVDLVVTSPP FALLRQKAYG 
NTDQSEYVEW LCRFGAEVRR VLRETGSFVL DLGGAYQRGV PVRSLYQYRV LLKMCDEVGF
YLAEEFFWYN PAKLPSPIEW VNKRKIRVKD SVNTVWWFSK SEWPKADVRQ VLAPYSERMK
TLLKNPDAFY KPKNRPSGHD ISKGFGSDNG GAIPSNLLQI PNTESNSSYL RLSKLVGIEA
HPARFPAALP EFFIKLLTTE GDLVLDIFAG SNTTGKVAEQ FGRRWIAMEI DAGYVAGSAL
RFMEHLPQSE VAACFARLSV SSRSAPVDLS GPATLFDEAK VERGT