Gene Cagg_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3343 
Symbol 
ID7267083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4055528 
End bp4056637 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content46% 
IMG OID643568152 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002464623 
Protein GI219850190 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00148167 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000416126 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACTAC CTTTAAATCA GGTTATTGAA GGCGATTGCG TGGAAATACT GAATACGTTA 
CCAGAAACAT CCATTGACCT TATTTTTGCC GATCCCCCCT ATCATTTACA ATTACAGAAC
GAACTGTATC GACCAAATAT GACGAAAGTG GACGCTGTCG ATGACGACTG GGACAAGTTC
GAGTCGATGC AAGCGTATGA TGAATTTACT CGAACGTGGT TAACGGCATG TAAGCGGGTC
TTGAAACCAA CCGGCACCAT CTGGGTTATC GGAACGTACC ATAATATCTT TCGTGTTGGG
GCCATAATGC AGGATTTAGG GTTCTGGATC CTCAATGATG TTATCTGGAT AAAACTCAAT
CCGATGCCTA ACTTTCGTGG TGTCCGGTTT ACCAATGCCC ATGAAACCCT CATTTGGGCA
AGTACCGGCA AAGATGCAAC ATATACGTTC AACTATTACG CGATGAAAGG GTTGAACGAT
GAAAAGCAAA TGCGTTCTGA TTGGTGGCTT TTACCGTTAG CGACGGGATC GGAACGGGTA
AAAAATGAAA ATGGCGATAA AGCCCATTCC ACGCAGAAGC CGGAGGCGTT ACTCTATCGG
GTGATTCTAT CCTCCAGCAA TCCCGGTGAT GTTGTGCTTG ACCCATTTTT TGGAAGCGGA
ACAACAGGTG TTGTCGCCAA ACGCTTGCAT AGAAATTGGA TTGGGATTGA AAAAGAGAAA
AAATATATCC AGATTGCGCA AAAGCGCATT GACGCAGTGC AACCAGAAAT GTTTGACGCT
GCGACGTTTG ACGTAAAGAG CAAAGCCAAA TCTGCTCCTA AAGTGGAGTT TTCGGTTCTG
GTCGAACATG GGTATGTACA GCCTGGGCAA CGATTGTTTT TTGGAAAAGA CAAAACGAAA
GTGGCCACAA TCAAGCCTGA TTCTCGGCTC CGTACTGCGG ACGGTTTCGA GGGCAGCATC
CATCAGGCCG GTAGCCATTA CATGAACAAT GCGCCCTGTA ATGGATGGGA GCATTGGTTT
ATCGAAGTTG ATGGTCAAAT GATCGGTCTT GGTGAAGTGA GAGAAAAGTT TCGGGTAGAC
AAGGGGCTTT ACAATGAGCG ATCAGGTTAA
 
Protein sequence
MPLPLNQVIE GDCVEILNTL PETSIDLIFA DPPYHLQLQN ELYRPNMTKV DAVDDDWDKF 
ESMQAYDEFT RTWLTACKRV LKPTGTIWVI GTYHNIFRVG AIMQDLGFWI LNDVIWIKLN
PMPNFRGVRF TNAHETLIWA STGKDATYTF NYYAMKGLND EKQMRSDWWL LPLATGSERV
KNENGDKAHS TQKPEALLYR VILSSSNPGD VVLDPFFGSG TTGVVAKRLH RNWIGIEKEK
KYIQIAQKRI DAVQPEMFDA ATFDVKSKAK SAPKVEFSVL VEHGYVQPGQ RLFFGKDKTK
VATIKPDSRL RTADGFEGSI HQAGSHYMNN APCNGWEHWF IEVDGQMIGL GEVREKFRVD
KGLYNERSG