Gene Cagg_2591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2591 
Symbol 
ID7267180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3161961 
End bp3164834 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content61% 
IMG OID643567415 
Producthelicase domain protein 
Protein accessionYP_002463896 
Protein GI219849463 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000604323 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAACCC CCGGTTCGAT TGTGCGCTGT CGCGAACGCG AGTGGGTGCT CTTGCCCGGC 
GCCGAAGATG ACCGCTTCTG GTTGCGACCG TTGATCGGAC GCAGCGATGA TGTGATTGCC
ATCTCTCGCA CGCTGAGTGA GATTGCCGGG TATACATTAC CGGAAGAGCG GGTGTGTGAC
GCCCATTTTC CGTTACCCAC ACCGAGTAAC ATTCAAGATG CGACGGCGGC AATGTTATTC
TGGCGGGCCG CTCGCATGGC GTTGCGTGAC GGAGCCACGC CATTGCGGTC GCTGGGGCGA
ATTTCCATCC GTCCGCGGTT GTACCAGTTT GTGCCCTTAT TGATGGCCCT CCGCCTTCAG
CCGGTTCGTT TGCTCATCGC CGATGATGTG GGGGTGGGCA AGACGATTGA GGCGTTGCTG
ATAGCCCGCG AATTGTGGGA TCGCGGCGAG ATCCGCAGTC TGGCCGTGCT TTGCCCACCG
TATCTCTGCG ACCAGTGGCA GCAAGAGCTT CGCCAGAAGT TTCATCTCGA TGCAGTCGTG
GTTCGGCCGG GTACCATCAG CGACCTCGAT CGGCAAGCAC CGCAGGGGGT AGATTTCTAT
CACCATTTTC CGGTCCAAGT GATCAGTATC GATTGGGTGA AGACCAGTCG GCATCGCGAC
CGTTTTCTCG TCCATTGCCC CGATTTCGTG ATTGTCGATG AAGCGCACGG TGTGGCGCCG
GCTACCGACA CTGCTCAGCA GTTGCGCCAT GAGTTGGTTC GTGAGCTTGC CGTCAAGGCC
GAGCGCCATC TCGTCTTGCT TACCGCGACG CCGCATAGCG GGAATCCCGA TACCTTCCGT
GCGTTGATCG GGTTGCTCGA TCCAGAGTTT GCAACGTGGG ATGTGAGTAA TCTGAGCGAC
GGCCAACGTG CTCGCCTTGC TCGTCATTTT GTCCAGCGCA CGCGCACCGA TATTGAACAG
AGTTGGCCGG ACGGGGTGCG CTGTTTTCCG CAGCGCGACC TGCGTGATGC GCATTACTAT
CTCACCCCCG CCTACGAGAG CCTCTTTCGC GATGTCTATA CCTTCTGCGC CGAGCTGGTC
AAACGGGGCG ACGGGCTGCG TGAGCCGCAG CGTCGGGTTC GCTATTGGGC CGCTCTCTCG
ATCTTGCGCT GTGTGATGTC GAGTCCGGCG GCGGCACGGG TTGCCTTACG CAACCGCGCG
GCGCGTCTCA CCATGGACGA CGATGCGCTC GCCGATGATA CGATCTGGCA GGGCGCGGTG
TTTGAGTCGG GTGAAACCGA GACCGATGAT GAGACGCCGA CGACCGTTAT CGAACAGGCA
GAGCTGTCGT TCAACGAGAG CGAACAGCGT CGGCTCGATG TGTTCATCCG ACGTGCCGAG
CAGATTGCCC AAAGCAATGA AGATGCGAAA CTGACCGGTG CCATCGACCT TGTCCGTCAG
TTGATCGACG AGGGCTACGC TCCTATCGTC TGGTGTCGGT ATGTGGCTAC CGCCGACTAT
GTCGCCCGTG CCTTTCGCCA CCATCTCTCC GGCGTGCATG TCACCTGCGT TACCGGACGG
ATGGGCGAGG AAGAACGGCG TGCCGTGATT GCTGCTGCGC CGGTCGATCA ACCGCGGGTG
CTGGTTGCCA CCGATTGCAT CTCCGAGGGG ATCAACCTGC ACGAACGCTA CAATGCTGCC
ATTCACTACG ATCTGCCATG GAACCCGAAC CGGCTTGAAC AACGTGAGGG TCGGGTCGAC
CGCTACGGGC AGACGGCGCC GACCGTGGTG ACGGTACGCT ACTATGGTCT CAACAATGAA
GTTGATACGG TGGTAATCGA TGTCTTACTG CGGAAAGCCC GCGAGATTCG CCGCGCGCTC
GGTGCGCACG TGCCGGTACC TGCCGAGAGT GTCATGGATG CGCTCACCAA GACTCTGTTT
TTACGGCGTG AGCAACCGGC CAACCAATTA CGCCTCGATC TCGTCGCCTC CGAAACGGTC
GCCTTTCATC AGCGCTGGGA CGAGGCAGTA GCCCGCGAAA AAAAGACGCG CACCCGCTTT
GCCCAGCACG CCCTCAAACT CGATGACGTG CGGCCGGTCA TTGAGGCCAC CGACCGCGTA
TTAGGTGATC CCGACGAAGT ACGGGACTTT GTCTTAGCGG CAGCGGGCCG GGTGGGTTTG
GTTATTCAAC CGCAACGCGG TCACCCTGAT GTGTTTCACG TGACCACCCT CCCGGAAACG
CCGCCTCCTA TTGCCGACGC CGTCCCAGAG GGCAACGGTG CGTGGGCGAT CACCTTCTCA
TCACCGGCGC CGGGTGGCGT GGAATATATC GGACGCAACC ACCGGCTCGT CAGCCGTCTC
GCCGGCTACC TGTTTTCGAT GGCACTTGCC CGCAGTCATC CAGACCACAA CGAGGTACCC
GTCGCCCGCA TCGGAGTCAT CCGCACCGAC AGCGTCAACC GGCTCACCGT CATCTGGTTA
ACGCGCGCAC GGTACCTGCT CCAATTCCCG GGAAGCCGCC ACCCCCTCCT CGCCGAAGAG
GCACTGGTGT CGGGGTATGT TGACGAAGGC GGTACCCACC GCTGGCTCGA CGAAGCGACG
GTCACACGCC TGCTCCGCGA GGCTCGATCG GCCGGCAACA TCTCGCCCGC CGAGAAGCGG
GAACTGGCCG AGATAGTGCT CGCCGAACTC GGTGAGGCCA ACCGTGACCA TCCGATCTGG
CAGGCACTGG CGACGCAAAC CGAACAACGG GCCGCCGAAT TAAAAGAGAT GTATCGTCGC
GTCCGTCAGG CCCTGCACTA TCATGTGCGT GGCATCGCCG TCGAACCGGT ATTCCCCCCC
GACCTGTTGG GGATGATCGT ATTGCAACCG ATTCCCCGGC GTGGGACATC GTAG
 
Protein sequence
MPTPGSIVRC REREWVLLPG AEDDRFWLRP LIGRSDDVIA ISRTLSEIAG YTLPEERVCD 
AHFPLPTPSN IQDATAAMLF WRAARMALRD GATPLRSLGR ISIRPRLYQF VPLLMALRLQ
PVRLLIADDV GVGKTIEALL IARELWDRGE IRSLAVLCPP YLCDQWQQEL RQKFHLDAVV
VRPGTISDLD RQAPQGVDFY HHFPVQVISI DWVKTSRHRD RFLVHCPDFV IVDEAHGVAP
ATDTAQQLRH ELVRELAVKA ERHLVLLTAT PHSGNPDTFR ALIGLLDPEF ATWDVSNLSD
GQRARLARHF VQRTRTDIEQ SWPDGVRCFP QRDLRDAHYY LTPAYESLFR DVYTFCAELV
KRGDGLREPQ RRVRYWAALS ILRCVMSSPA AARVALRNRA ARLTMDDDAL ADDTIWQGAV
FESGETETDD ETPTTVIEQA ELSFNESEQR RLDVFIRRAE QIAQSNEDAK LTGAIDLVRQ
LIDEGYAPIV WCRYVATADY VARAFRHHLS GVHVTCVTGR MGEEERRAVI AAAPVDQPRV
LVATDCISEG INLHERYNAA IHYDLPWNPN RLEQREGRVD RYGQTAPTVV TVRYYGLNNE
VDTVVIDVLL RKAREIRRAL GAHVPVPAES VMDALTKTLF LRREQPANQL RLDLVASETV
AFHQRWDEAV AREKKTRTRF AQHALKLDDV RPVIEATDRV LGDPDEVRDF VLAAAGRVGL
VIQPQRGHPD VFHVTTLPET PPPIADAVPE GNGAWAITFS SPAPGGVEYI GRNHRLVSRL
AGYLFSMALA RSHPDHNEVP VARIGVIRTD SVNRLTVIWL TRARYLLQFP GSRHPLLAEE
ALVSGYVDEG GTHRWLDEAT VTRLLREARS AGNISPAEKR ELAEIVLAEL GEANRDHPIW
QALATQTEQR AAELKEMYRR VRQALHYHVR GIAVEPVFPP DLLGMIVLQP IPRRGTS