Gene Cagg_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3197 
Symbol 
ID7267344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3880696 
End bp3882324 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content58% 
IMG OID643568018 
ProductATPase associated with various cellular activities AAA_5 
Protein accessionYP_002464491 
Protein GI219850058 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.046131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.40537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCT CGTTACCCGC CCTGCAAGCC TTGCTCGAAC CCGGCGTTGG ACCACGCTGG 
CAAGCGGTGC AACGCCTACT CCATCCAGAA ATGTTGGCGC TCGCCAATCG AGTTGCCGAG
CGCGCAGCAA AATTGTTGGC TCGCCAATGG CCACTGTACG AGCTGAGTTT TAAGACACGT
CGCGCAATTG ACCGTGGTCG CGGTCGGCGT GACCCGATTG AAGATTACTG GTTTGCATTC
GACCGGCCAC CGCGCGGCGC CGGGGTTATG CTGACGGTAA GTGGGGCAGA ACGCACGGTT
GCAGCCGGCC TGCAACTCTG GGGCATCCGC CGGCCACAAT TAGCCCAACT CTGGCGCGAT
GCCCGTCCTT TGTGGGAGCC GCTCATTGAC CGGATCGGCA CCGAGGGACA GGCCCGCTTT
ACCAGTCGCC GGCCGCTACT GCCGGGGATG CGCTGGATCG ATCATTATTT GGCACAACGC
CAGGCACAGT ATCTGTGGGC CGGATTTGTT TATCCGTGGG AGAATTTGCC ATCGGCTGAA
CGGTTGATTG ATGATCTTTG TGCGCTCTTG CCGCTCAATG AAGCGTTGAT GGAACTAGCA
GAGCTTGATC TGGCAGCAAC CAATACGCTC TGCGAAACAT CGGCAGTGTA CGTAGCGGGA
ACACCGCTGT TCGAGCAGAT CGCAGCTACT ATTCGGCGAC GCGGGATGGT GATCGACGAT
CAAACGCTAC ACGCTTTTCA CCTCGCCGTA CAAGCCCGGC CGTTGGTCAT TCTGGCCGGG
CCAAGCGGTA GTGGTAAGAC GTGGTTGACC CGCCTCTACG CCGACGCATT GATCGGGGTT
GGAGAGGGTC AAGCGAATCC GATCTATCTG CCGGTAGCCG TCCAACCTGA CTGGCATAGC
GCCCGTGATC TGCTAGGGTA TTACAATTCG TTGACCGGGT TATACCAACC AACACCATTT
TTGCGTCACC TGCTTGCAGC CGCGGGCGAT CCCGGTCAAA CGTACATTGT CTGCCTCGAT
GAGATGAATC TGGCCCGACC TGAATATTAC CTTGCCCCAA TCTTATCGGC AATGGAAACG
TCCGACGGAC TGATCGATCT GGGTACTCCG CTCGCCGAAA CACCACTCGC GGGCGGTGGC
ACCGTGCGCA ATCCATTCCG TTTACCGATC AATGTACGGC TGATCGGCAC GGTTAATGTT
GATGAAAGCA CCTTTGCGCT GAGCGACAAA GTGCTCGACA GGGCCAATCT GATCGAGTTA
AAACAGGTCG ATCTGCCGGC GTTACGCACG ATGTATGGGC AGCGTATTGA CGAACAGGTA
TGGCACCTGC TCACCGAGTT CCAAACGCAT CTGAGTGCAG CCGGCAAGCC AGCGGGATAC
CGAGCGCTCG GCGAGACGTT GCGCTTCATC GAGCAAGTAA GTGGAATGAG TGCATTGGAA
GCGCTCGATA TGCAGCTTAT GCAACGATTA CTGCCTCGTG TCCGTGGTGA TGACACACCA
CGGTTCCGCC AAGCGCTGCG CAGCTTACTT ACCCTCTGTA GTGGCCGCTT ACCGCGCAGC
GCCGAGCGGC TCGAACAGAT GTTAGCACGC CTTGACCGCG AAGGATACAC TGACTTTTAC
GGCTATTAA
 
Protein sequence
MPFSLPALQA LLEPGVGPRW QAVQRLLHPE MLALANRVAE RAAKLLARQW PLYELSFKTR 
RAIDRGRGRR DPIEDYWFAF DRPPRGAGVM LTVSGAERTV AAGLQLWGIR RPQLAQLWRD
ARPLWEPLID RIGTEGQARF TSRRPLLPGM RWIDHYLAQR QAQYLWAGFV YPWENLPSAE
RLIDDLCALL PLNEALMELA ELDLAATNTL CETSAVYVAG TPLFEQIAAT IRRRGMVIDD
QTLHAFHLAV QARPLVILAG PSGSGKTWLT RLYADALIGV GEGQANPIYL PVAVQPDWHS
ARDLLGYYNS LTGLYQPTPF LRHLLAAAGD PGQTYIVCLD EMNLARPEYY LAPILSAMET
SDGLIDLGTP LAETPLAGGG TVRNPFRLPI NVRLIGTVNV DESTFALSDK VLDRANLIEL
KQVDLPALRT MYGQRIDEQV WHLLTEFQTH LSAAGKPAGY RALGETLRFI EQVSGMSALE
ALDMQLMQRL LPRVRGDDTP RFRQALRSLL TLCSGRLPRS AERLEQMLAR LDREGYTDFY
GY