Gene Cagg_3660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3660 
Symbol 
ID7268195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4450066 
End bp4452111 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content56% 
IMG OID643568466 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_002464932 
Protein GI219850499 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0589668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCACA TCGACGTGAC CCAACGCATC AATGAGTTGC GGACGCTTAT CCGTCGCTAT 
GATTACCACT ACTACGTCCT CGACGATCCC ATTGTGAGCG ACGCCGAATA CGATGCCTTA
ATGAACGAAC TACGCGCACT AGAAGCGGCG CATCCAGAAT TGATCACCCC CGACTCACCA
ACCCAGCGGG TCAGCGGCAC ACCGGCCAGT CAGTTTGCGA AAGTGCAACA TCCTCAGCCG
ATGCTCTCGC TTGGCAATGC TTTCACCACG ACCGATCTGC TTGCCTGGCG TGATCGAGTG
CTGCGCTTAC TCGGTCAGGA TACAGCGGTA GCCTATGTGG TCGAACCCAA GATCGACGGG
CTGGCAGTAG CATTGACATA CCATGACGGC CAATTCGTGC AAGGTGCAAC GCGCGGTGAT
GGTGAGGTAG GTGAAGATGT AACCGCGAAT CTGCGCACGA TTAGTAGCAT TCCGCTGCTG
TTGCACCCAC CAGATGGCGG CCACGACCGA GATGTACCCA CCGCGTTACC CTCACTGATC
GAGGTGCGCG GTGAAGTCTA TATGCGTACC GCTGACTTCG AGGCACTGAA TGATCGTCTG
GCCGCTGCCG GTGAGAAGAT CTTCGCCAAC CCACGCAATG CCGCTGCCGG TTCACTTCGC
CAAAAGGATC CGGCTATTAC TGCTGCTCGA CCGCTGCGCT TTTTTGCCTA CGGTGTCGGT
CCGGTCGAAG GAGTCGAGCT GACCGGTCAA TGGCAGACGT TACGGTATCT ACGGATGCTA
GGCTTTCCGG TCAATCAAGA TGTCCGTCGT TTTACCGATT TCGATGAGGT ACTGGCCTAC
TGTGAACAAT GGATGGCCCG TCGCGATGAA TTAACCTACG AAGCTGATGG GATGGTCATC
AAGATTGACG ATTTTGCCCA GCAGCGTGAG CTTGGCGTGG TTGGGCGCGA TCCACGCTGG
GCTATTGCCT TTAAATTTCC ACCGCGCGAG GCTATCACTC GTTTACTCAA TATTACGGTG
AATGTTGGAC GGACCGGCGT CGTTACGCCA AATGCCGAGT TAGAGCCGGT ACAGATCGGT
GGAGTCATTG TGCGTAATGC GAGCCTGCAT AACGCCGACT ATATCGCTCA GCGTGATATT
CGGATCGGCG ATTACGTCAT TGTGAAACGG GCCGGTGACG TGATTCCGTA TGTCGTTGGG
CCGGTCGTGG CCAGGCGTGA TGGCAGCGAG CGACCGTGGC AGTTTCCGAC TCATTGTCCG
GCATGCGGTT CACCACTTGA ACGCGAACCG GGTGAGGCGG CCTGGCGTTG TAACAATTTT
GGGATTTGTC CGGCGCAGTT GGTTCGTCGG CTAGAACACT TTGCCAGCCG TGCTGCACTC
GATATTGTTG GCCTGGGTGA ACGACAGGCC GAGTTGTTTG TGCAACGTGG TTTGGTGCGT
GATGTAGCCG ATTTGTTCTA CCTGAAAGCA GAGGATTTTG CCGGATTAGA AGGGTTTGGC
CCGAAGCGGA TCGCGAACTT GCTGAACGCC ATTGACGCTG CGCGCCAACG ACCACTCGAC
CGCTTGATCG TTGGTTTGGG GATTCGCTAT GTCGGCTCCG TGGCAGCCCA GGCATTGGTC
AACTCCTTAG GATCGCTCGA CGCGATTATG AATGCTCGCC AAGAAGAACT CGAGCAGATT
CCCGGAATTG GGCCGGTGGT TGCGGCAAGC ATTGTTGATT TCTTTGCCCA CCCCGAAAAT
CGGCAGTTGA TCGAAAAGTT GCGCGCGGCA GGAGTTCAGA TGAACGCCGG CCCCCAGCGT
GAACGGAAGA GTGATGTGCT CGCCGGACAG ACGTTTGTCC TCACCGGAAC GTTACCTTCA
CTGACCCGCG AGCAGGCCAG CGCTCTGATC ATCGCACACG GCGGTAAAGT CACCGATAGC
GTCAGTAAGA AGACGAATTA TGTCGTAGCG GGGGTGAATG CCGGTAGCAA ATTAGCGAAG
GCCCAGCAAC TTGGTATTCC GGTGCTTGAT GAAGCCGCGC TGTTGGCGCT GATCGGTGAG
CGGTGA
 
Protein sequence
MNHIDVTQRI NELRTLIRRY DYHYYVLDDP IVSDAEYDAL MNELRALEAA HPELITPDSP 
TQRVSGTPAS QFAKVQHPQP MLSLGNAFTT TDLLAWRDRV LRLLGQDTAV AYVVEPKIDG
LAVALTYHDG QFVQGATRGD GEVGEDVTAN LRTISSIPLL LHPPDGGHDR DVPTALPSLI
EVRGEVYMRT ADFEALNDRL AAAGEKIFAN PRNAAAGSLR QKDPAITAAR PLRFFAYGVG
PVEGVELTGQ WQTLRYLRML GFPVNQDVRR FTDFDEVLAY CEQWMARRDE LTYEADGMVI
KIDDFAQQRE LGVVGRDPRW AIAFKFPPRE AITRLLNITV NVGRTGVVTP NAELEPVQIG
GVIVRNASLH NADYIAQRDI RIGDYVIVKR AGDVIPYVVG PVVARRDGSE RPWQFPTHCP
ACGSPLEREP GEAAWRCNNF GICPAQLVRR LEHFASRAAL DIVGLGERQA ELFVQRGLVR
DVADLFYLKA EDFAGLEGFG PKRIANLLNA IDAARQRPLD RLIVGLGIRY VGSVAAQALV
NSLGSLDAIM NARQEELEQI PGIGPVVAAS IVDFFAHPEN RQLIEKLRAA GVQMNAGPQR
ERKSDVLAGQ TFVLTGTLPS LTREQASALI IAHGGKVTDS VSKKTNYVVA GVNAGSKLAK
AQQLGIPVLD EAALLALIGE R