Gene Sare_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3092 
Symbol 
ID5706827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3512609 
End bp3515434 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content68% 
IMG OID641272528 
Productputative helicase 
Protein accessionYP_001537896 
Protein GI159038643 
COG category[L] Replication, recombination and repair 
COG ID[COG1112] Superfamily I DNA and RNA helicases and helicase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.209687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0418433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGATCA ACGAGTTCGG CAGGTTGGCG TCGTCGAATG CGCTGCCGGC CCGGGTCGAG 
CCGGACCGCT CCCGACTGGT CGTCTACGGC GAGCGCTGGG CAGCCTGGCT CTACCCCACC
AACCACGGTG ACGCGTACCA GCTCGGACAC GTCGCACCGC TCTCCTTCAA GGACCAGGAA
CGCCTTCTCA AGGCGGCGCT GGTCCTCAGC GGTACGAGGG GCTGGTGGGC CTACCACCAC
GTCCGGGACG TGCCGCAGCG GTCGTCGTCA CACTGGCCGC TGCTCTGCCG GGCCTGGGCT
GATCTCGCCG CCCCCCGTCC GGGCGCGACA CCACCGATGC CAGCACATCA TGTCGAGTAC
CTTGACCTGC TCACCGAGGT CATTGAGGCA ACCCGGGATA TCGAGATCGC GGGGCAAAGT
CGGGAGCCGG CCCTGCCGTA CCGGAACCTG GCCAGCACCC GTGAGGAGCG GTACTCCGCC
CGCGGCGTCT ACGCCTTCCA CCTGCTCCGT GACGCCACCG TGGGCCGAGG TGCACTGGTC
TTCGTCACCG ATCAACCCGA CCTACGCGGC CGGGTGCTGC GGATCAAGGA CCGGGAGATC
ACCGTAAGAT TCGACGACAC CGTCGACTAC GCCCGCATCC CCCAGCAAGG TGCGTTGCAG
GTGCTGCCCA GCGACCGGGT CTACCGTGCG CAGCTCGACG CGGTGGAGAC TCTCCGGGAA
CAACGTGCCA CCCAGCCCCA CCTGCTCACC CAGTTGGTCG ACCAGCCTCT CGCGCCGTAT
CAGCCGGACT CCGATGCGCA GCCGCGCGAG ACGCTCGACC CGACACAGCT GCATGCCTTC
CGCGCTGCCC TGACTGTGCC CGATCTGCTG TTGGTGCTCG GACCGCCCGG CACCGGGAAG
ACCCGCACCA TCACCGAGAT CGCCGCCGGG TGCGCCGAAG GTGGGCAGCG GGTACTGGTC
ACCTCCCACA CCAACCGCGC CGTCGACAAC GTGCTCGAAC GACTACCGCC GGATGTCCGG
GCAGTCCGGG TCGGCAACGA GGACACCATG ACCACGCACG CACGTGGGTT CATGGTGGAG
ACGCAGGTCG AAGCATTACG GCAGGAGATT CTCGCGGCAA CCGCAGGTAC GGCTTCCCGC
TTGGCTACCT TCACCGGCAC CGATGATCAG GCTGGGCGGT GGCTCGGGTA CCTGGCCGCG
CGGCTAGCGG AGGCCCAGGG CGCGAACCGG GACGTCCAGG TCCGCACCGC CGAGTTGGCA
GCCGCGGCCG AGCGGGCAAT GGCACCCTTC GCCGCACAGC TCGCTGCCGC CGATCAACGA
GCACGGCAGT CACGTGAGAG TGTGGTACCG CTCGCCGAAA GGCACCGCCA CCATGAACGG
CAGACCGAGA CGGCACGGCG GCGAGCCGCC TCAGGAGCGC CGGCTTTCTT CTTCCGCTGG
CTCGCCGACC GCCGAGAACG GCATCTCGCC GCGATCTCCG AGCAGCTCAA CGCCGCGCGG
ACGGCAATGC GACTGGCCTT GGAGTCGTAT GCTGCGGTTC GTGCCCGAGC CGACGCGCTG
GTCGCTGCCG ACCCGGGAGT TCGGGCAATC ACCAGTGCCC GCGACGGTGC CTCCAAGGTA
TATGCGAAGG CGCTGCGCGA CGCCGCTCGG GCAGTGGAAG CAGCGCGCGA GGTGCTGCGC
CCAGCGGTCG CAGTGCCCGG TGGGGTGCCT GACGATCTGG CCGGCTGGAC GCGGTTATAC
GAGGAGCTGA TGTCGGCGGC GGCCCTGGCC CGCAACCGGG CCGGGCTACT CGCGCAGTGG
CGTGAGCAGG TGGCCGTCGC CGAGCAGGAC CTGCATCGAG AGCTGGTCCG GTACGCGGAT
GTGGTGGCGG CCACCTGCAT CGGCACCGCC ACCACCACCC TGCTCGCCGA GCTGGAATTC
GATGTGGCGA TCGTCGACGA GGCCGGGCAG ATCTCCACGC CGAACCTGTT GGTGCCGCTG
GTAAGGGCCC GCCGGGCGGT GCTGGTCGGC GACCACAATC AACTGCCACC CTTCCTCGAC
GACGAGGTAC GCAGCTGGGC CGACCGTATC GCCTCTGATA GGCCCCCTGA GGTCGCGACA
TTGGTCGGCG ACGTGCTGCG GCGCAGCGCC TTCGAACGGC TGTATCCCCG GTTGGCAGAC
ACCAACCGGG TCATGTTGCG GGTGCAGCGA CGCATGCCGG CGGAGCTGGC CCAGTTCGTG
TCGAACGCCT TCTACCGGGG ACTATTGGAG ACCGATCATC CCGGCGGTCC GCCCGACCCG
GTGTTCAGCG CGCCACTGGC GATGATCGAC ACGTCGGACC AGCCAGCGAC GCGACGGAGG
GAGCGGCCCG ACCGCTCGGT GAATGGGCTG GTGCGACCCG GCTACGTCAA CGATCTGGAG
GCCAGGCTGA TCGTCCAGCT GTTGGGCCGG AACGCCACAC GGTACGTCGA CTGGGCGGTC
ATCGTCCCGT ATCGGGCCCA GGCGGAGCTG ATCACGCAGC TACTCAGAAA AGAGCTTGGT
GACGCCGGGG TGGCCGACAA CGTGGGCACC GTCGACTCGT TCCAGGGCGG TGAACGGGAT
CTGATTGTCT ACGGGTTCAC CCGCAGCAAC CACCGGGGCC AGATCGGCTT TCTCACCGAA
CTCCGACGCC TCAACGTGGC AATCACCCGA CCCCGGCGTC AACTCGTCCT GGTGGGCGAT
ACGACTACCT TGCGTGCCGC ACGGGACCCG GGATTCGCCG CGTTGATCCA ATCGTTGATC
GCCCACCTCG ACGCGGTGGG TGACCGCCGA CCCTCCCGGG AGATCGAGGG GGTGCCCAAT
GACTGA
 
Protein sequence
MLINEFGRLA SSNALPARVE PDRSRLVVYG ERWAAWLYPT NHGDAYQLGH VAPLSFKDQE 
RLLKAALVLS GTRGWWAYHH VRDVPQRSSS HWPLLCRAWA DLAAPRPGAT PPMPAHHVEY
LDLLTEVIEA TRDIEIAGQS REPALPYRNL ASTREERYSA RGVYAFHLLR DATVGRGALV
FVTDQPDLRG RVLRIKDREI TVRFDDTVDY ARIPQQGALQ VLPSDRVYRA QLDAVETLRE
QRATQPHLLT QLVDQPLAPY QPDSDAQPRE TLDPTQLHAF RAALTVPDLL LVLGPPGTGK
TRTITEIAAG CAEGGQRVLV TSHTNRAVDN VLERLPPDVR AVRVGNEDTM TTHARGFMVE
TQVEALRQEI LAATAGTASR LATFTGTDDQ AGRWLGYLAA RLAEAQGANR DVQVRTAELA
AAAERAMAPF AAQLAAADQR ARQSRESVVP LAERHRHHER QTETARRRAA SGAPAFFFRW
LADRRERHLA AISEQLNAAR TAMRLALESY AAVRARADAL VAADPGVRAI TSARDGASKV
YAKALRDAAR AVEAAREVLR PAVAVPGGVP DDLAGWTRLY EELMSAAALA RNRAGLLAQW
REQVAVAEQD LHRELVRYAD VVAATCIGTA TTTLLAELEF DVAIVDEAGQ ISTPNLLVPL
VRARRAVLVG DHNQLPPFLD DEVRSWADRI ASDRPPEVAT LVGDVLRRSA FERLYPRLAD
TNRVMLRVQR RMPAELAQFV SNAFYRGLLE TDHPGGPPDP VFSAPLAMID TSDQPATRRR
ERPDRSVNGL VRPGYVNDLE ARLIVQLLGR NATRYVDWAV IVPYRAQAEL ITQLLRKELG
DAGVADNVGT VDSFQGGERD LIVYGFTRSN HRGQIGFLTE LRRLNVAITR PRRQLVLVGD
TTTLRAARDP GFAALIQSLI AHLDAVGDRR PSREIEGVPN D