Gene Cagg_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1123 
Symbol 
ID7268577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1386076 
End bp1388976 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content57% 
IMG OID643565966 
Producthelicase domain protein 
Protein accessionYP_002462469 
Protein GI219848036 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0214668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0288722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCCG AAGGCGACTG GGTGCTCGTT CACCCGCAAA ACGCCGTCGG TAAAATCATT 
GAACAACGTT GCTTATGGGG CAATACCTTC TTCCGCGTCT GGCTTCCATC AACCGACTCG
ATCATCCGCG TGAACGCGGC CAACTTGTTA CCGGTTCAGT CCATAGTCCT CAGCCCTCAT
CATCTTATCT ACCTCACCGC CGCGACACGG ATCGCCGATG CGCTGAGCCA AGATGTCTTG
CTTGCACCCG TTGAATCGTC TGTCATCCCG TTGCCGCACC AACTGCGCGC ACTTTCGCGG
GCCATTTCCA CCGACCGCGT GCGCTATCTG CTGGCCGATG AGGTGGGTCT CGGCAAGACT
ATCGAGGCCG GTCTGATTAT GAAAGAACTC AAGCTCCGCG GCTTGGTGAG GCGAACGCTA
ATTATCGCCC CGAAGGGTCT CGTGACCCAA TGGGTGGCCG AGATGGCGAT GCACTTCAAT
GAACAGTTCC ATGCGATCCT GTCTGAGGAT TACAAGAGCT TAAAACGAAT CGCGGCAATC
GCCAAGACAG AGGCTCGCGG AATGCGGACA ACATCCTTGG AACCATCTTC CCTCATCTCG
CCTCCCTCGT CCATCTTCAC CGCCAATGCA AACCCGTTTA CCGCATTTGA TCAGGTCGTC
GTTTCCATGG ACTCCGTCAA GCCGCCATCA AATGCCGACA GCGCGGCCTG GGAACGTTTC
GAGGACCTGA TCTCCGCCGG CTGGGACTTG GTGATCGTAG ACGAGGCCCA TCGCTTAGGA
GGAAGCACCG ATCAGGTGGC CCGCTACAGG CTTGGTCAAG GATTGGCCGA GGCTGCACCC
TACCTCCTTC TGCTCTCTGC CACGCCTCAT CAAGGCAAGA CCGAGGCCTT CTACCGACTG
ATCGCCTTGC TCGATAGTCA AGCCTTCCCC GATGTGAATA GCGTGACGAA GGAGCGGGTA
CAGCCCTACG TCATCCGCAC CGAGAAGCGC CGTGCCATTG ATGCAGAGGG CAAGCCGCTG
TTTAAGCCTC GGCGCACAGA ATTAGCGCCG GTCTCTTGGG AGGAACGTCA TCGGGATCAA
CGGCTGCTCT ACGAGGCAGT CACCGAATAT GTACGCGAAG GCTATAACCA GGCCATCCGC
GAGAAACGCA ACTACATCGG TTTTCTGATG ATCCTGATGC AGCGCCTGGT GGTCTCTAGC
ACACGCGCGA TTAAGACAAC GCTCGAGCGC CGTCTTGAAG TGCTCAGTGC CGAGAACTCA
AGGTTGAGTC AGCGGGAGTT GTTCGATGCA GCGCTTAACG CTGACGACTT TTACGATCTA
GACGGTGAAG AACAGCTCCA ACTGCTGTTG AGGACACGTA TTCAGGGGAT ACGTGACGAA
CAGACCGAGG TAACGCTGCT GTTAGAATTG GCAAGACGAT GTGAAGCCCA AGGCCCAGAT
GCCAAAGCCG AAGCACTCCT AGACTGGATT TACCGTCTCC AAGCCGAGGA AGGCGACCCA
AACCTTAAGG TGCTGGTGTT CACCGAGTTC GTGCCTACCC AAGAGATGCT GTACGAATTT
CTCACCCAAC GCGGCTTCAC GGTGGTATGC CTCAACGGCG CTATGGACAT GGCAGCGCGG
AAGCAAGTTC AAGACGCCTT TGCCAACAAT GCGCGCATCC TCATTTCGAC AGACGCCGGC
GGTGAGGGGC TTAACCTCCA GTTCTGCCAC GTGGTGATCA ACTACGACAT CCCTTGGAAC
CCAATGCGCC TCGAACAACG GATCGGCCGC GTTGACCGCA TCGGTCAGAC CCATACGGTC
CGCGCCGTCA ATTTTGTGTT CCAGGACTCG GTCGAACACC GTATCCGCGA GGTCCTTGAG
CAAAAGCTGG CAGTCATCTT GGAAGAGTTC GGTATCGACA AGACCGGTGA CGTGCTCGAC
TCCGCCCAAG CGGGTCAAAT CTTCGATGAT TTGTATGTGG AGGCCATCCT CAATCCTGCC
GAGGTGGAGC AGAAAGTTGA CGAAGTCCTG AACAGTATCC GCGCCCAAGC ACAGGAGTTC
CGGCAGAGTA CCGCCTTGCT GGGCGCGACC GAAGACTTAG ACCCGCACGA GGCCGAACGA
CTGCTCGCCC ACCCGCTCCA GCATTGGGTC GAAGCGATGA CTACCGCATA CCTACGAAGT
GTAAGGGACG AGGGGCGGAG TGCGAACGGC AAAAAACCGG GCGCGGAACC GATTGGCAAA
GAGGGGTCTC ATACGATCTG GCATCTCATT TGGCCGGATG GCCACGAAGA GACTGTTTGC
TTTACCCATA CCACCCTCCG CACGCCACAT TTGGGGCTTG AAGACCCCCG CGTGCTCGGT
CTGGCGATGC GTCTCCCTTC CTTTGTGCCG GGACAACCGA TTCCGGTCAT CACGCTTCCC
AACCTTGCCG CAGAGATCGT AGGGTTCTGG TCACTCTGGC GGATCGCAAT CTCGACCACC
GATTGGAACC GTCGCCGGAT CATGCCCTTG TTTCTCACCG ACAACGGTCG GGTATTCGCC
CCGACTGCCC GACGAATCTG GGATCACCTG GTGGTGGCGA ACCCTACCAT TCTGCGGCAT
CTCGACAGCG AGACCTCGTA TCACATCTTC GAACGTCTCT GGGAGGCGGC GGAACAACAG
GGGAACGCGA TCTACCATGA ACTGGTGCAG GCTCAGCGGG AGCATCTCGC GCGCGAGCGT
GAGAAGGGCG AGTACGCTTT TGCTGCACGA CGCCGAGTTA TCGAACGGCT CGGACTGCCC
GAAGTTCGTA GCTATCGTCT GCGGCGCTTA CAACAGGAAG AAGAACAATT CAGAGAACAA
CTCGAACGTA GCTCGCAGAT TTTGCCGGAA ATGGTGCCTG TGCTGATCGT CCGGGTGGAA
CCAGACCAGG ATGGGGGGTG A
 
Protein sequence
MFAEGDWVLV HPQNAVGKII EQRCLWGNTF FRVWLPSTDS IIRVNAANLL PVQSIVLSPH 
HLIYLTAATR IADALSQDVL LAPVESSVIP LPHQLRALSR AISTDRVRYL LADEVGLGKT
IEAGLIMKEL KLRGLVRRTL IIAPKGLVTQ WVAEMAMHFN EQFHAILSED YKSLKRIAAI
AKTEARGMRT TSLEPSSLIS PPSSIFTANA NPFTAFDQVV VSMDSVKPPS NADSAAWERF
EDLISAGWDL VIVDEAHRLG GSTDQVARYR LGQGLAEAAP YLLLLSATPH QGKTEAFYRL
IALLDSQAFP DVNSVTKERV QPYVIRTEKR RAIDAEGKPL FKPRRTELAP VSWEERHRDQ
RLLYEAVTEY VREGYNQAIR EKRNYIGFLM ILMQRLVVSS TRAIKTTLER RLEVLSAENS
RLSQRELFDA ALNADDFYDL DGEEQLQLLL RTRIQGIRDE QTEVTLLLEL ARRCEAQGPD
AKAEALLDWI YRLQAEEGDP NLKVLVFTEF VPTQEMLYEF LTQRGFTVVC LNGAMDMAAR
KQVQDAFANN ARILISTDAG GEGLNLQFCH VVINYDIPWN PMRLEQRIGR VDRIGQTHTV
RAVNFVFQDS VEHRIREVLE QKLAVILEEF GIDKTGDVLD SAQAGQIFDD LYVEAILNPA
EVEQKVDEVL NSIRAQAQEF RQSTALLGAT EDLDPHEAER LLAHPLQHWV EAMTTAYLRS
VRDEGRSANG KKPGAEPIGK EGSHTIWHLI WPDGHEETVC FTHTTLRTPH LGLEDPRVLG
LAMRLPSFVP GQPIPVITLP NLAAEIVGFW SLWRIAISTT DWNRRRIMPL FLTDNGRVFA
PTARRIWDHL VVANPTILRH LDSETSYHIF ERLWEAAEQQ GNAIYHELVQ AQREHLARER
EKGEYAFAAR RRVIERLGLP EVRSYRLRRL QQEEEQFREQ LERSSQILPE MVPVLIVRVE
PDQDGG