Gene Sare_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0450 
Symbol 
ID5705320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp516394 
End bp518040 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content72% 
IMG OID641269975 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001535370 
Protein GI159036117 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000480349 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGAAC TCACCCACAA CCTCATGGAC GGCACCGAAC TCGCCGCCAC CGCACCAGTA 
AGCCCGGAAG CGCCTACCTT CGCCGAGTTG GGCGCACGTC AGGAGACCGT CGACGCGCTG
GCCGCGGCCG GCATCACCCG GGCCTTCGCC ATCCAGGAGT ACGCACTGCC GATCGCGTTG
CGCGGCGTCG ACCTGATCGG CCAAGCCCCG ACCGGTACCG GCAAGACCCT TGGCTTCGGC
GTCCCACTGC TGGAGCAGGT CCTCGCTCCC GCTGAGGGCG GCGACGGCAC CCCGCAGGCG
CTGGTCGTCG TTCCGACCCG CGAGCTGGGC ATCCAGGTCG CCAAGGACCT CCAGGCCGCG
GGAAGCACCC GTGGTGTCCG AGTACTACCG ATCTACGGCG GGGTGGCGTA CGAGCCACAG
ATCGAGGCGC TCCGCTCGGG TGTCGAGATC CTGGTCGGCA CCCCCGGCAG GCTACTCGAC
CTGGCCAAGC AGAAGCACCT GAAGCTCGAT CGGGTTCGCG CACTCGTCCT GGACGAGGCC
GACCGAATGC TCGACCTGGG TTTCCTCGAC GACGTCGAGA GGATCCTGGC GATACTGCCG
GAGGACCGGC AGACGATGCT CTTCTCGGCG ACCATGCCGG ACCCGATCGT CGCGCTGTCC
CGGCGCTTCC TGCGCCGGCC GGTGACAATC CACGCCGGGC ACACCGCCGA GACCGGCCCC
TCACCCCAGA CCCAGCAGTT GGCCTACCGC ACCCACTCAC TTAACAAGAT CGAGATTGTG
GCGCGGATCC TCCAGGCCAG GGGGCGCGGA CTGACCATGA TCTTCACCCG GACCAAGCGG
GCGGCCGACC GGGTAGCAGC AGACCTGGAC TTCCGTGGAT TCGCCGTGGC CGCCGTGCAC
GGCGACCTCG GGCAGGGCGC GCGGGAGCGG GCGCTGCGGG CGTTCCGCAC GGGCAAGATC
GACACTCTGG TCGCCACCGA CGTGGCCGCC CGGGGCATTG ACGTCAGCGG CGTCACCCAC
GTCCTCAACT ACGACTGTCC GGAAGACCAG GACACATACA CCCACCGGAT CGGCCGGACC
GGGCGGGCGG GGGCGAGCGG CGTCGCGGTG ACCTTCGTCG ACTGGGACGA CATGCCGCGC
TGGCGGATCA TCGACAAGAC CCTCGGCCTG GACATGCCGG AGCCGCCGGA GACCTACCAC
ACCTCCCCGC ACCTCTATGC GGACCTTGAC ATCTCCCCTG AGGTCACCGG CACCCTGCCA
ACCGGCGCAC GAACCCGGGC CGGACTCTCC GCCGAGGTCG AGGAAGACCT CGGCGGGCGA
GCCCGCCGGG GCGATAGCCG GGGCACCCGC CGCGGCGCAG GCCGCAACCG CCGCCGGGAC
CGCGGCGGGG ACGCCGGACG TGGTGCTGCC GCGACCACGG AGCCGGCCGA GGCAACCGAG
ACCGCGGAGC GCCCACCCCG CCGACGGCAA CGCCGCCGGG GCGGCGAGGT GGTGTCCAGC
GGCCAACCCG CGGTGACCAC CGCCGAGACC GACGAGCCGG CTGTCGTCGA CCCGCAGGGG
GCAGCGGCGC CGAAGCCACG GCGGCGTCGG CGTCGCCGTG GTGGTGGCTC GGGCACCGGT
GCGGCGGCCG GGACAACCAC CGACTGA
 
Protein sequence
MSELTHNLMD GTELAATAPV SPEAPTFAEL GARQETVDAL AAAGITRAFA IQEYALPIAL 
RGVDLIGQAP TGTGKTLGFG VPLLEQVLAP AEGGDGTPQA LVVVPTRELG IQVAKDLQAA
GSTRGVRVLP IYGGVAYEPQ IEALRSGVEI LVGTPGRLLD LAKQKHLKLD RVRALVLDEA
DRMLDLGFLD DVERILAILP EDRQTMLFSA TMPDPIVALS RRFLRRPVTI HAGHTAETGP
SPQTQQLAYR THSLNKIEIV ARILQARGRG LTMIFTRTKR AADRVAADLD FRGFAVAAVH
GDLGQGARER ALRAFRTGKI DTLVATDVAA RGIDVSGVTH VLNYDCPEDQ DTYTHRIGRT
GRAGASGVAV TFVDWDDMPR WRIIDKTLGL DMPEPPETYH TSPHLYADLD ISPEVTGTLP
TGARTRAGLS AEVEEDLGGR ARRGDSRGTR RGAGRNRRRD RGGDAGRGAA ATTEPAEATE
TAERPPRRRQ RRRGGEVVSS GQPAVTTAET DEPAVVDPQG AAAPKPRRRR RRRGGGSGTG
AAAGTTTD