Gene Cagg_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3340 
Symbol 
ID7267080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4052982 
End bp4054025 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content49% 
IMG OID643568150 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002464621 
Protein GI219850188 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000107199 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000355353 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGTG TGATTGAGGA AATGGACTTC AAGATAGACC GATTGCCCAA GGGTTTGCAA 
GATACGTTCC GCGAGCTGTA TGTGAACGGG AACGGGAATG GGACTCTGCA TGACGTGCGT
CCGAACGAAA TCTATGTGGG AGATGCTCGA GCGCTCTTAC CAAACATAGA GCCTAATAGT
ATTGCGTTGA GTGTTTGGTC ACCACCCTAT TTTGTTGGTA AAGAATATGA GGCGCACTTG
TCATTTGAAG ATTGGCAGGA TCTGTTACGA ACGGTCATCC ATCTTCATTT CCCGATCATC
AAACCTGGAG GGTTTCTGGT GATCAACATC GCTGACATTC TGGTGTTCAA AGATCCTTCG
ATGCCTCGTA TTCAAGCCGA AGCGGTGACC AGAAAGCGTT GTCCCGTGAC AAAAGCGGAT
GTATTGAAAG CGATGGCCGA ACATCCAGAC TATAACCGTT ATCAGCTTGC GAAGCTGCTT
GGATGCAGCG AACAAACGAT CGACCGTCGG CTGCACGGCA ACAACATCCG AGGTGGAAAG
TATGAATCAC AAACTCGCGT CAAGATTGTT GGCGGTCTTG TGGAAGAGTG GGCGTTAAGT
GCCGGGTTGT ATCCGTATGA CCGTCGCATT TGGGTGAAAG ATGCTGCTTG GGAAAACTCG
CGGTGGGCGA GTCTCTCCTA CCGATCGGTC GATGAGTTTG AGTACCTGTA TTTCTTCTGG
AAACCAGGAA TTACCAAATT TGATAGAAAA AGGCTTTCCG CCGACGAATG GAAGAATTGG
GGTTCCAGGG GAGTGTGGTA TGTTCCCTCG GTGAGAGCGA ATGACGATCA TGAGGCCAAA
TTTCCCATAG AGTTACCCAC CAGGGTCATT CGATTGCTGA CCGATCCTGG TGATATTGTG
CTTGATTGTT TCATGGGAAG CGGGACAACA GCAGTAGCAG CCATACGCGA GAATCGTCAG
TATATCGGGA TAGAGATTCT GGAAAAATAT GTAAACTTGG CACGCCAACG AATTGCAGCG
GAACATTTCA GTACGGGGAA ATAA
 
Protein sequence
MSGVIEEMDF KIDRLPKGLQ DTFRELYVNG NGNGTLHDVR PNEIYVGDAR ALLPNIEPNS 
IALSVWSPPY FVGKEYEAHL SFEDWQDLLR TVIHLHFPII KPGGFLVINI ADILVFKDPS
MPRIQAEAVT RKRCPVTKAD VLKAMAEHPD YNRYQLAKLL GCSEQTIDRR LHGNNIRGGK
YESQTRVKIV GGLVEEWALS AGLYPYDRRI WVKDAAWENS RWASLSYRSV DEFEYLYFFW
KPGITKFDRK RLSADEWKNW GSRGVWYVPS VRANDDHEAK FPIELPTRVI RLLTDPGDIV
LDCFMGSGTT AVAAIRENRQ YIGIEILEKY VNLARQRIAA EHFSTGK