Gene Sare_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3336 
Symbol 
ID5708291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3847618 
End bp3849468 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content71% 
IMG OID641272763 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001538130 
Protein GI159038877 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00306919 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCTCCC CCACCGACCA GCGCCCCGCC GCGCTGGAGA CGCTGCGGCG TGTCTTCGGC 
TACGACGCCT TCCGGGGCTT CCAGCAGGAG GTCATCGCAC ACCTGACGGC CGGCGGGGAC
GCCTTGGTGT TGATGCCCAC CGGTGGCGGC AAGTCGCTGT GCTACCAGAT CCCGGCACTG
CTGCGGGACG GAGTCGCGGT GGTCGTGTCG CCGTTGATCG CACTGATGCA GGACCAGGTC
GACGCGCTGA CCGCGGTCGG GGTTCGGGCC GGCTTCCTCA ACTCGACGCT GGACCTCGAC
GCCCGTCGCG CCGTCGAGCG GGCCTTCGTC GCCGGTGACC TGGACCTGCT CTACCTCGCC
CCGGAAGCCC TGGGCACCCG GGGCGTGCAG CACCTGCTCG ACCAAGGGAA CATCAGCCTG
TTCGCGATCG ACGAGGCACA CTGTGTGTCG CAGTGGGGGC ACGACTTCCG CCCGGACTAC
CTCACCCTGT CGGTGCTGCA CGAGCGTTGG CCCGGGGTGC CCCGCATCGC GTTGACCGCC
ACCGCGACCA GCGCCACTCG GGCCGAGATC TCGACCCGGC TGCAGCTCAC CTCGGCCCGG
CACTTTGTCG CCAGCTTCGA CCGCCCCAAC ATCCAGTACC GCATCGTCCC GAAGCGGGAG
CCGAAGCGGC AGCTGCTGGC CCTGCTGCGG GACGAGCACC CGGGGGATGC CGGAATCGTC
TACTGCCTGT CCCGGGCCAC CGTGGAAAAG ACAGCGGAGT TCCTGGTCGA CAACGGTATT
GCCGCACTGC CGTACCACGC CGGCCTGGAC GCGGCCACCC GAGCTCGACA CCAGCAGCGC
TTCCTGCGGG AGGACGGCCT GGTCATGGTC GCGACGATCG CGTTCGGGAT GGGCATCGAC
AAGCCTGACG TGCGGTTCGT CGCCCACCTC GACCTGCCGA AGTCGGTGGA GGGCTACTAC
CAGGAGACCG GCCGCGCCGG GCGGGACGGC CTGCCGTCGA CGGCCTGGCT CGCCTACGGT
CTGACGGATG TGGTGCAGCA ACGCCGGCTG ATCGACACCT CGGAGGGGGA TCTGGCGCAC
CGGCGTAACC TCGCCGCCCA CCTGGAGGCG ATGCTCGCGC TCTGCGAAAC GGTCCGCTGT
CGCCGGGTGC AGCTGCTGGA CTACTTCGGC GAGACCGCCA CCGCCTGCGG CAACTGCGAC
ACGTGCCTGC AGCCACCCGA GTCGTGGGAC GGCACGATCG CCGCGCAGAA GCTGCTGTCC
ACGGTGTACC GGCTCGACCG GGAGCGACAC CAGCGGTTCG GCACCGGGCA CTGTGTCGAT
ATCCTGCTCG GCCGCGCCAC CGACAAGGTC CAGCAGCACC GGCACGACTC CCTGACAGTG
TTCGGGATCG GCACCGAGCT GAGCGAGGCG GAGTGGCGGG GTGTGGTCCG GCAGCTGCTC
GCCGAAGGGC TGCTGGCGGT TGAGGGCGAC TACGGCACCC TGGCCCTCAC CGACACCAGC
GCGGAGGTGC TGGGCCGGCG CCGCACCGTC ATGCTGCGCC GCGAACCGGC CCGGACCGCC
CGGCCGGCGA AGCCACGCGG CGCGGCCACC ATGGTCGCCG AGCTGGCCCC GGCCGCCGCC
GAGGTCTTCG AGCGGCTACG CGCCTGGCGG GCCGCCACGG CCAGGGAACA GGGCGTGCCC
GCCTACGTGA TCTTCCACGA CGCCACGCTG CGGCAGATCG CCAGCGACGC ACCGTCAGCA
TTGGCTGACC TGGCCCGGGT CAGTGGTGTC GGCGAGGCGA AACTCGCGAA GTACGGCGAG
CAGGTGCTGG CCGTCCTCGC CGGCGGCGAT GCGGACCCAC ACACCGCCTG A
 
Protein sequence
MPSPTDQRPA ALETLRRVFG YDAFRGFQQE VIAHLTAGGD ALVLMPTGGG KSLCYQIPAL 
LRDGVAVVVS PLIALMQDQV DALTAVGVRA GFLNSTLDLD ARRAVERAFV AGDLDLLYLA
PEALGTRGVQ HLLDQGNISL FAIDEAHCVS QWGHDFRPDY LTLSVLHERW PGVPRIALTA
TATSATRAEI STRLQLTSAR HFVASFDRPN IQYRIVPKRE PKRQLLALLR DEHPGDAGIV
YCLSRATVEK TAEFLVDNGI AALPYHAGLD AATRARHQQR FLREDGLVMV ATIAFGMGID
KPDVRFVAHL DLPKSVEGYY QETGRAGRDG LPSTAWLAYG LTDVVQQRRL IDTSEGDLAH
RRNLAAHLEA MLALCETVRC RRVQLLDYFG ETATACGNCD TCLQPPESWD GTIAAQKLLS
TVYRLDRERH QRFGTGHCVD ILLGRATDKV QQHRHDSLTV FGIGTELSEA EWRGVVRQLL
AEGLLAVEGD YGTLALTDTS AEVLGRRRTV MLRREPARTA RPAKPRGAAT MVAELAPAAA
EVFERLRAWR AATAREQGVP AYVIFHDATL RQIASDAPSA LADLARVSGV GEAKLAKYGE
QVLAVLAGGD ADPHTA