Gene Clim_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0041 
Symbol 
ID6355564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp46674 
End bp48521 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content55% 
IMG OID642667666 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001942128 
Protein GI189345599 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00446699 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCGA CCGGATCGGA TTCAGCCTTG TTCGATACCC TTCGAAAGGT TTTCGGATTC 
CGTGAGTTTC GCCCGAATCA GGAACGCATT GTACGGGCAA TTCTGAATAA GCGGGATGTG
TTTGCCGTTA TGCCCACCGG GGGCGGCAAA TCTCTCTGCT ACCAGCTGCC CGCAGTGCTG
CTTCCGGGAA CCTGCATGGT GATCAGCCCG CTTATCGCGC TCATGAAAGA TCAGGTTGAC
GGAGCGAGGG CCAATGGCAT CCGCGCTGCG CATCTCAACA GCTCCCTCTG CCCGGAAGAA
CGAACTGCCG TTATGCACGA CCTGCTCTCG AATTCGCTCG ACCTGCTCTA TGTGGCCCCC
GAACGCTTTA CGCTCGAGCA GTTCCGGGAG ATGCTCGGAC GGGTGAACAT CAGCATGGCC
GTAATCGACG AAGCTCACTG TATTTCGGAG TGGGGACACG ATTTCCGGCC GGACTATCTC
TCTCTTTCCG CTCTGGTCAC CCTCTTTCCC GATCTGCCGG TTACCGCCTT TACGGCTACG
GCCACGCACC TGGTGCAGCA GGATATTCTC GATAAACTCT CACTGCGCGA TCCGCTTGTC
GTCAGGGCCT CCTTTGACCG CGGCAATCTT TTCTACGACA TCCGTTTCAA GGAAAACAGC
GGGGAACAGA TTGCGGCGAT TGTGAGAAGC AATCAGGGAA AAGCCGGAAT CATCTACCGC
ACCAGCCGTA AAAACGTCAA CGACACGACG GCCATGCTTA AAGCTAAAGG GTTCAGGGCA
CTCCCCTACC ACGCCGGGTT GGGAGACGAG GAGCGCAAGC GCAATCAGGA TGCCTTCATC
CGCGATGAAG CGGACGTTAT TGTCGCCACG GTCGCATTCG GTATGGGGAT AGATAAATCG
AACATCCGCT TCGTCATCCA TGCCGACCTG CCGAAAAGCA TTGAAAATTA CTATCAGGAA
ACCGGCAGGG CCGGACGCGA CGGCGAGGCT GCGCAATGCA CGTTGCTGTT CTCGCAGGGA
GACATCCCCA AAGTGCGCTT CTTCATCGAC ACGATAACCG ATGAAGCCGA GAGGGCCCGG
GTTCTCGCCG CATTTTCAAA AGTTATCGCA TTCGCATCCA CATCGGTCTG CAGGCGGAAA
ACCCTGCTTG ACTATTTCGG TGAAACCTAC CCACACGATA ACTGCAACTC CTGTGACATC
TGCCTCGGCA CACGAGAGGT CATTGACGCC ACCACTGAGT CGCAGATGCT GCTTTCGGCA
ATCGCCCGGA CTGAAGAACG GTTCGGAGCG ACGCATATTG TGGATATTGT GACCGGAAGC
CGGAACCAGA AAATCAGGGA CTTCGGGCAT GACCGGCTTA AAACTTACGG AGTGGGAAAG
GGACGCGAAA AAAAATTCTG GCGGCAGCTG ATCGATGAGC TTCTTGCCCG GAAAGTGATC
GCGAAATCCG AGGGCCTCTT CCCTATGCTG TATCTGCTGC CAAAAGCGGT TCAGGTTCTG
AGAAACGAAG AAAAAGTCGA AATCGTCAGA GTAAGGGAAA AGAAAGCGAC CGGAAATGTA
AAGAATCTGG TGGAGGGGAG TTACGATCAT GAACTGTTTG ATCTGCTCCG GTCTCTCAGG
AAACAGATCG CCGACGAACT GGGTATTCCG CCCTATGTGG TCTTTTCCGA CCGCTCGCTG
CGCGATATGG CCTCCATTCT GCCGCAGAGC GACGAGACCA TGCTTTCGGT TTCGGGAGTC
GGAGAGGTCA AGCTTCAGCG ATATGGACAA CAGTTCCTCG CGCTCATAAA AAAATACAGA
AAAGAGCATC CGGAAGAAAC TTCCCGATTG CCGATACTGC CTCGATGA
 
Protein sequence
MQATGSDSAL FDTLRKVFGF REFRPNQERI VRAILNKRDV FAVMPTGGGK SLCYQLPAVL 
LPGTCMVISP LIALMKDQVD GARANGIRAA HLNSSLCPEE RTAVMHDLLS NSLDLLYVAP
ERFTLEQFRE MLGRVNISMA VIDEAHCISE WGHDFRPDYL SLSALVTLFP DLPVTAFTAT
ATHLVQQDIL DKLSLRDPLV VRASFDRGNL FYDIRFKENS GEQIAAIVRS NQGKAGIIYR
TSRKNVNDTT AMLKAKGFRA LPYHAGLGDE ERKRNQDAFI RDEADVIVAT VAFGMGIDKS
NIRFVIHADL PKSIENYYQE TGRAGRDGEA AQCTLLFSQG DIPKVRFFID TITDEAERAR
VLAAFSKVIA FASTSVCRRK TLLDYFGETY PHDNCNSCDI CLGTREVIDA TTESQMLLSA
IARTEERFGA THIVDIVTGS RNQKIRDFGH DRLKTYGVGK GREKKFWRQL IDELLARKVI
AKSEGLFPML YLLPKAVQVL RNEEKVEIVR VREKKATGNV KNLVEGSYDH ELFDLLRSLR
KQIADELGIP PYVVFSDRSL RDMASILPQS DETMLSVSGV GEVKLQRYGQ QFLALIKKYR
KEHPEETSRL PILPR