Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2591 |
Symbol | |
ID | 7267180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3161961 |
End bp | 3164834 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643567415 |
Product | helicase domain protein |
Protein accession | YP_002463896 |
Protein GI | 219849463 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000604323 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAACCC CCGGTTCGAT TGTGCGCTGT CGCGAACGCG AGTGGGTGCT CTTGCCCGGC GCCGAAGATG ACCGCTTCTG GTTGCGACCG TTGATCGGAC GCAGCGATGA TGTGATTGCC ATCTCTCGCA CGCTGAGTGA GATTGCCGGG TATACATTAC CGGAAGAGCG GGTGTGTGAC GCCCATTTTC CGTTACCCAC ACCGAGTAAC ATTCAAGATG CGACGGCGGC AATGTTATTC TGGCGGGCCG CTCGCATGGC GTTGCGTGAC GGAGCCACGC CATTGCGGTC GCTGGGGCGA ATTTCCATCC GTCCGCGGTT GTACCAGTTT GTGCCCTTAT TGATGGCCCT CCGCCTTCAG CCGGTTCGTT TGCTCATCGC CGATGATGTG GGGGTGGGCA AGACGATTGA GGCGTTGCTG ATAGCCCGCG AATTGTGGGA TCGCGGCGAG ATCCGCAGTC TGGCCGTGCT TTGCCCACCG TATCTCTGCG ACCAGTGGCA GCAAGAGCTT CGCCAGAAGT TTCATCTCGA TGCAGTCGTG GTTCGGCCGG GTACCATCAG CGACCTCGAT CGGCAAGCAC CGCAGGGGGT AGATTTCTAT CACCATTTTC CGGTCCAAGT GATCAGTATC GATTGGGTGA AGACCAGTCG GCATCGCGAC CGTTTTCTCG TCCATTGCCC CGATTTCGTG ATTGTCGATG AAGCGCACGG TGTGGCGCCG GCTACCGACA CTGCTCAGCA GTTGCGCCAT GAGTTGGTTC GTGAGCTTGC CGTCAAGGCC GAGCGCCATC TCGTCTTGCT TACCGCGACG CCGCATAGCG GGAATCCCGA TACCTTCCGT GCGTTGATCG GGTTGCTCGA TCCAGAGTTT GCAACGTGGG ATGTGAGTAA TCTGAGCGAC GGCCAACGTG CTCGCCTTGC TCGTCATTTT GTCCAGCGCA CGCGCACCGA TATTGAACAG AGTTGGCCGG ACGGGGTGCG CTGTTTTCCG CAGCGCGACC TGCGTGATGC GCATTACTAT CTCACCCCCG CCTACGAGAG CCTCTTTCGC GATGTCTATA CCTTCTGCGC CGAGCTGGTC AAACGGGGCG ACGGGCTGCG TGAGCCGCAG CGTCGGGTTC GCTATTGGGC CGCTCTCTCG ATCTTGCGCT GTGTGATGTC GAGTCCGGCG GCGGCACGGG TTGCCTTACG CAACCGCGCG GCGCGTCTCA CCATGGACGA CGATGCGCTC GCCGATGATA CGATCTGGCA GGGCGCGGTG TTTGAGTCGG GTGAAACCGA GACCGATGAT GAGACGCCGA CGACCGTTAT CGAACAGGCA GAGCTGTCGT TCAACGAGAG CGAACAGCGT CGGCTCGATG TGTTCATCCG ACGTGCCGAG CAGATTGCCC AAAGCAATGA AGATGCGAAA CTGACCGGTG CCATCGACCT TGTCCGTCAG TTGATCGACG AGGGCTACGC TCCTATCGTC TGGTGTCGGT ATGTGGCTAC CGCCGACTAT GTCGCCCGTG CCTTTCGCCA CCATCTCTCC GGCGTGCATG TCACCTGCGT TACCGGACGG ATGGGCGAGG AAGAACGGCG TGCCGTGATT GCTGCTGCGC CGGTCGATCA ACCGCGGGTG CTGGTTGCCA CCGATTGCAT CTCCGAGGGG ATCAACCTGC ACGAACGCTA CAATGCTGCC ATTCACTACG ATCTGCCATG GAACCCGAAC CGGCTTGAAC AACGTGAGGG TCGGGTCGAC CGCTACGGGC AGACGGCGCC GACCGTGGTG ACGGTACGCT ACTATGGTCT CAACAATGAA GTTGATACGG TGGTAATCGA TGTCTTACTG CGGAAAGCCC GCGAGATTCG CCGCGCGCTC GGTGCGCACG TGCCGGTACC TGCCGAGAGT GTCATGGATG CGCTCACCAA GACTCTGTTT TTACGGCGTG AGCAACCGGC CAACCAATTA CGCCTCGATC TCGTCGCCTC CGAAACGGTC GCCTTTCATC AGCGCTGGGA CGAGGCAGTA GCCCGCGAAA AAAAGACGCG CACCCGCTTT GCCCAGCACG CCCTCAAACT CGATGACGTG CGGCCGGTCA TTGAGGCCAC CGACCGCGTA TTAGGTGATC CCGACGAAGT ACGGGACTTT GTCTTAGCGG CAGCGGGCCG GGTGGGTTTG GTTATTCAAC CGCAACGCGG TCACCCTGAT GTGTTTCACG TGACCACCCT CCCGGAAACG CCGCCTCCTA TTGCCGACGC CGTCCCAGAG GGCAACGGTG CGTGGGCGAT CACCTTCTCA TCACCGGCGC CGGGTGGCGT GGAATATATC GGACGCAACC ACCGGCTCGT CAGCCGTCTC GCCGGCTACC TGTTTTCGAT GGCACTTGCC CGCAGTCATC CAGACCACAA CGAGGTACCC GTCGCCCGCA TCGGAGTCAT CCGCACCGAC AGCGTCAACC GGCTCACCGT CATCTGGTTA ACGCGCGCAC GGTACCTGCT CCAATTCCCG GGAAGCCGCC ACCCCCTCCT CGCCGAAGAG GCACTGGTGT CGGGGTATGT TGACGAAGGC GGTACCCACC GCTGGCTCGA CGAAGCGACG GTCACACGCC TGCTCCGCGA GGCTCGATCG GCCGGCAACA TCTCGCCCGC CGAGAAGCGG GAACTGGCCG AGATAGTGCT CGCCGAACTC GGTGAGGCCA ACCGTGACCA TCCGATCTGG CAGGCACTGG CGACGCAAAC CGAACAACGG GCCGCCGAAT TAAAAGAGAT GTATCGTCGC GTCCGTCAGG CCCTGCACTA TCATGTGCGT GGCATCGCCG TCGAACCGGT ATTCCCCCCC GACCTGTTGG GGATGATCGT ATTGCAACCG ATTCCCCGGC GTGGGACATC GTAG
|
Protein sequence | MPTPGSIVRC REREWVLLPG AEDDRFWLRP LIGRSDDVIA ISRTLSEIAG YTLPEERVCD AHFPLPTPSN IQDATAAMLF WRAARMALRD GATPLRSLGR ISIRPRLYQF VPLLMALRLQ PVRLLIADDV GVGKTIEALL IARELWDRGE IRSLAVLCPP YLCDQWQQEL RQKFHLDAVV VRPGTISDLD RQAPQGVDFY HHFPVQVISI DWVKTSRHRD RFLVHCPDFV IVDEAHGVAP ATDTAQQLRH ELVRELAVKA ERHLVLLTAT PHSGNPDTFR ALIGLLDPEF ATWDVSNLSD GQRARLARHF VQRTRTDIEQ SWPDGVRCFP QRDLRDAHYY LTPAYESLFR DVYTFCAELV KRGDGLREPQ RRVRYWAALS ILRCVMSSPA AARVALRNRA ARLTMDDDAL ADDTIWQGAV FESGETETDD ETPTTVIEQA ELSFNESEQR RLDVFIRRAE QIAQSNEDAK LTGAIDLVRQ LIDEGYAPIV WCRYVATADY VARAFRHHLS GVHVTCVTGR MGEEERRAVI AAAPVDQPRV LVATDCISEG INLHERYNAA IHYDLPWNPN RLEQREGRVD RYGQTAPTVV TVRYYGLNNE VDTVVIDVLL RKAREIRRAL GAHVPVPAES VMDALTKTLF LRREQPANQL RLDLVASETV AFHQRWDEAV AREKKTRTRF AQHALKLDDV RPVIEATDRV LGDPDEVRDF VLAAAGRVGL VIQPQRGHPD VFHVTTLPET PPPIADAVPE GNGAWAITFS SPAPGGVEYI GRNHRLVSRL AGYLFSMALA RSHPDHNEVP VARIGVIRTD SVNRLTVIWL TRARYLLQFP GSRHPLLAEE ALVSGYVDEG GTHRWLDEAT VTRLLREARS AGNISPAEKR ELAEIVLAEL GEANRDHPIW QALATQTEQR AAELKEMYRR VRQALHYHVR GIAVEPVFPP DLLGMIVLQP IPRRGTS
|
| |