Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0041 |
Symbol | |
ID | 6355564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 46674 |
End bp | 48521 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642667666 |
Product | ATP-dependent DNA helicase RecQ |
Protein accession | YP_001942128 |
Protein GI | 189345599 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0514] Superfamily II DNA helicase |
TIGRFAM ID | [TIGR00614] ATP-dependent DNA helicase, RecQ family [TIGR01389] ATP-dependent DNA helicase RecQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00446699 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCGA CCGGATCGGA TTCAGCCTTG TTCGATACCC TTCGAAAGGT TTTCGGATTC CGTGAGTTTC GCCCGAATCA GGAACGCATT GTACGGGCAA TTCTGAATAA GCGGGATGTG TTTGCCGTTA TGCCCACCGG GGGCGGCAAA TCTCTCTGCT ACCAGCTGCC CGCAGTGCTG CTTCCGGGAA CCTGCATGGT GATCAGCCCG CTTATCGCGC TCATGAAAGA TCAGGTTGAC GGAGCGAGGG CCAATGGCAT CCGCGCTGCG CATCTCAACA GCTCCCTCTG CCCGGAAGAA CGAACTGCCG TTATGCACGA CCTGCTCTCG AATTCGCTCG ACCTGCTCTA TGTGGCCCCC GAACGCTTTA CGCTCGAGCA GTTCCGGGAG ATGCTCGGAC GGGTGAACAT CAGCATGGCC GTAATCGACG AAGCTCACTG TATTTCGGAG TGGGGACACG ATTTCCGGCC GGACTATCTC TCTCTTTCCG CTCTGGTCAC CCTCTTTCCC GATCTGCCGG TTACCGCCTT TACGGCTACG GCCACGCACC TGGTGCAGCA GGATATTCTC GATAAACTCT CACTGCGCGA TCCGCTTGTC GTCAGGGCCT CCTTTGACCG CGGCAATCTT TTCTACGACA TCCGTTTCAA GGAAAACAGC GGGGAACAGA TTGCGGCGAT TGTGAGAAGC AATCAGGGAA AAGCCGGAAT CATCTACCGC ACCAGCCGTA AAAACGTCAA CGACACGACG GCCATGCTTA AAGCTAAAGG GTTCAGGGCA CTCCCCTACC ACGCCGGGTT GGGAGACGAG GAGCGCAAGC GCAATCAGGA TGCCTTCATC CGCGATGAAG CGGACGTTAT TGTCGCCACG GTCGCATTCG GTATGGGGAT AGATAAATCG AACATCCGCT TCGTCATCCA TGCCGACCTG CCGAAAAGCA TTGAAAATTA CTATCAGGAA ACCGGCAGGG CCGGACGCGA CGGCGAGGCT GCGCAATGCA CGTTGCTGTT CTCGCAGGGA GACATCCCCA AAGTGCGCTT CTTCATCGAC ACGATAACCG ATGAAGCCGA GAGGGCCCGG GTTCTCGCCG CATTTTCAAA AGTTATCGCA TTCGCATCCA CATCGGTCTG CAGGCGGAAA ACCCTGCTTG ACTATTTCGG TGAAACCTAC CCACACGATA ACTGCAACTC CTGTGACATC TGCCTCGGCA CACGAGAGGT CATTGACGCC ACCACTGAGT CGCAGATGCT GCTTTCGGCA ATCGCCCGGA CTGAAGAACG GTTCGGAGCG ACGCATATTG TGGATATTGT GACCGGAAGC CGGAACCAGA AAATCAGGGA CTTCGGGCAT GACCGGCTTA AAACTTACGG AGTGGGAAAG GGACGCGAAA AAAAATTCTG GCGGCAGCTG ATCGATGAGC TTCTTGCCCG GAAAGTGATC GCGAAATCCG AGGGCCTCTT CCCTATGCTG TATCTGCTGC CAAAAGCGGT TCAGGTTCTG AGAAACGAAG AAAAAGTCGA AATCGTCAGA GTAAGGGAAA AGAAAGCGAC CGGAAATGTA AAGAATCTGG TGGAGGGGAG TTACGATCAT GAACTGTTTG ATCTGCTCCG GTCTCTCAGG AAACAGATCG CCGACGAACT GGGTATTCCG CCCTATGTGG TCTTTTCCGA CCGCTCGCTG CGCGATATGG CCTCCATTCT GCCGCAGAGC GACGAGACCA TGCTTTCGGT TTCGGGAGTC GGAGAGGTCA AGCTTCAGCG ATATGGACAA CAGTTCCTCG CGCTCATAAA AAAATACAGA AAAGAGCATC CGGAAGAAAC TTCCCGATTG CCGATACTGC CTCGATGA
|
Protein sequence | MQATGSDSAL FDTLRKVFGF REFRPNQERI VRAILNKRDV FAVMPTGGGK SLCYQLPAVL LPGTCMVISP LIALMKDQVD GARANGIRAA HLNSSLCPEE RTAVMHDLLS NSLDLLYVAP ERFTLEQFRE MLGRVNISMA VIDEAHCISE WGHDFRPDYL SLSALVTLFP DLPVTAFTAT ATHLVQQDIL DKLSLRDPLV VRASFDRGNL FYDIRFKENS GEQIAAIVRS NQGKAGIIYR TSRKNVNDTT AMLKAKGFRA LPYHAGLGDE ERKRNQDAFI RDEADVIVAT VAFGMGIDKS NIRFVIHADL PKSIENYYQE TGRAGRDGEA AQCTLLFSQG DIPKVRFFID TITDEAERAR VLAAFSKVIA FASTSVCRRK TLLDYFGETY PHDNCNSCDI CLGTREVIDA TTESQMLLSA IARTEERFGA THIVDIVTGS RNQKIRDFGH DRLKTYGVGK GREKKFWRQL IDELLARKVI AKSEGLFPML YLLPKAVQVL RNEEKVEIVR VREKKATGNV KNLVEGSYDH ELFDLLRSLR KQIADELGIP PYVVFSDRSL RDMASILPQS DETMLSVSGV GEVKLQRYGQ QFLALIKKYR KEHPEETSRL PILPR
|
| |