Gene Cagg_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1070 
Symbol 
ID7268522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1320412 
End bp1322205 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content58% 
IMG OID643565915 
Productprotein of unknown function DUF324 
Protein accessionYP_002462420 
Protein GI219847987 
COG category[L] Replication, recombination and repair 
COG ID[COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.478197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTC GTGCCATCAC TGCCAATCTA ACGCTGCGTA CAGCGCTTCA CGTGGGCACG 
GGCAGCGATA CGGAAACGAC CGACGATCTC TTGCGCCGTG ACGTGCGGGG GCGGCTGCTC
ATACCGGGTA CCGCTATCGC CGGTGTGCTG CGCAGCATTG CCACGCGGCT CGCGCCTCGC
TTTGGGGAAA GTCCGTGTCA AGTGATTGAC AACCAATCGT CAAACGACGC CTGTCAATGC
CTCGTCTGTC AACTCTTCGG CGATGTGAAT CCAAGGGAAG ATAGTGATAC CGCCACTGCT
ACCCGGGTGC TGGTCTACGA TGCCGTGCTC GATACGATCC CATCGCTGAC CATCCGCGAT
GGCGTTGGGA TTGATCGGGT GACCGGTGCG GCTGCGCGCC GCGAACGGAT TAAGTTCGAT
TACGAAGTGT TGCCGGCAGG GACGGTATTT ACTCTGAGGC TTGAGATCGA TCCCAAACTG
CCTGATGTGC CGAAACTGCT GCCCTTATTG GCGGCGACGT TGGCCGAATG GGAACAGGGA
CGAGGCGCAA TCGGTGGTCG GGTCAATCGG GGGTTGGGTG CGTTTACCCT CAACGAGGTG
CAGTGGATCG AGCGCGATCT GAATCAGGCA GGGGTGCTGA TCGAGTTTTT ACACCGCGGG
CCGCCTTGGC ACACTACTGA CGGCGACCGC ACATGGCTGA CCAATCAGGT CCAGCAGGTG
CGGGCGTGGG TAGAACCGTA CCAAGGTAAT GATCCGGTTG CCCGTTCATG GGTTTTGGCC
GAATTTACAC TGGCCGCAAC CGGTCCCTTT CTGACCAACG ATGTGGCTCA AGCCGGTCGG
AGCGGGTTTG ATCATGCGCC GGTGTTCGCG GCTTATGAGC AAGGGGCTAA GCCGGTGTTG
CCCGGTTCGA GCCTGCGCGG CGCCTTGCGG AGTCAGGCCG AGCGGATCGC CCGTACTCTG
GCAACCTTCA GCGCGTGGGA TAACGGTAAG GATATGAAGA GTCGCAAAAC GTCATTCTTA
GCGACATGCC CCGCCTGTAA TCCGCTGACG ACCAAGACCG ATGACCCGGT TGCAAGTTGT
AACAGCTTTA TCAAGGCTCG ACCAAAGGTT GAACGAGATA CGTTGGAGCA AAAAGGGGCA
GAGGAGAAAC TCTGTCTCGC CTGCCGGCTC TTCGGCAGCC CGTGGAACGG CAGTCGGTTG
CGCGTCGAAG ATGCACCATT TGTCGGCGAC AAGGTCACTC TCAAGGTGCT CGATTTTCTG
GCTATCGACC GCTTTACCGG CGGCGGACGC GATACGGCCA AGTTCGACGC GGTGGTGCTG
TGGCAGCCGA AGTTTCGGGT GCGGCTCTTC CTTGAGAATC CAGAGCCGTG GGAATTGGGT
TGGCTGGCGC TGGTGTTGCG CGATCTGCAC GATGGGTTGA TCACTATTGG CTTCGGCGCT
GCCAAAGGTT TTGGTCAGTG CAGGATCGAA GACGGTGCTG TGACTCTCGG TATTATTCAC
GAAAGCGATT TTCCGCTGTC TGAGCCGAAT AACCAGGAGC CGGCCGCGCA GCAGGCGATT
ACCGCTAAAC AGCAACTTTT GCAATCGAAA GGCGGGATGA GCGGCGTCTA TCGCACGTTG
ATGCTTGATC CGGCAACAGA GAACGATTGG AGGACACTGG TAGAGAGCTG GATCAGAGCG
TTCAATGTGA CGGTCAAGGA ATATAAACAT GCCTTCGGTT TGAAGAAAGA CAGCTACTTC
GATAAAATCA ACGGCACATG GTTGCCTGAT CTCTATCCGG CGAGGGTGTC ATGA
 
Protein sequence
MKFRAITANL TLRTALHVGT GSDTETTDDL LRRDVRGRLL IPGTAIAGVL RSIATRLAPR 
FGESPCQVID NQSSNDACQC LVCQLFGDVN PREDSDTATA TRVLVYDAVL DTIPSLTIRD
GVGIDRVTGA AARRERIKFD YEVLPAGTVF TLRLEIDPKL PDVPKLLPLL AATLAEWEQG
RGAIGGRVNR GLGAFTLNEV QWIERDLNQA GVLIEFLHRG PPWHTTDGDR TWLTNQVQQV
RAWVEPYQGN DPVARSWVLA EFTLAATGPF LTNDVAQAGR SGFDHAPVFA AYEQGAKPVL
PGSSLRGALR SQAERIARTL ATFSAWDNGK DMKSRKTSFL ATCPACNPLT TKTDDPVASC
NSFIKARPKV ERDTLEQKGA EEKLCLACRL FGSPWNGSRL RVEDAPFVGD KVTLKVLDFL
AIDRFTGGGR DTAKFDAVVL WQPKFRVRLF LENPEPWELG WLALVLRDLH DGLITIGFGA
AKGFGQCRIE DGAVTLGIIH ESDFPLSEPN NQEPAAQQAI TAKQQLLQSK GGMSGVYRTL
MLDPATENDW RTLVESWIRA FNVTVKEYKH AFGLKKDSYF DKINGTWLPD LYPARVS