Gene Sare_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1438 
Symbol 
ID5708061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1664025 
End bp1665752 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content73% 
IMG OID641270947 
Producttype III restriction protein res subunit 
Protein accessionYP_001536328 
Protein GI159037075 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00286846 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAGCCC GGACGCCAAC GCTCGACACG TTCCCGGCCC TGCGTGCCTG GCAGCGCAGG 
GCGCTGGTGG AGTACCTGCG CCGGCGGGAG CCGGACTTCA CGGCGGTGGC CACGCCGGGC
GCCGGCAAGA CCACCTTCGC CCTTCGGGTC GCCGCCGAGT TGCTCGCCGA CGGGAGCGTC
GAGGCGGTCA CCGTGGTCGC GCCGACCGAG CACCTCAAGG CCCAGTGGGC GCAGGCCGCC
GCCCGGGTCG GCATCCAACT CGACGCGGCC TTCCGTAACG CCGACCTGCA CTCGTCGGCC
GACTTCCACG GGGCCGTGGT CACCTACGCC CAGGTCGGGA TGGCGCCGCA GGTGCACCGG
CGGCGTACCC TGACCCGGCG CACCCTGGTC GTCCTCGACG AGATCCACCA TGCCGGCGGT
TCCCGGACCT GGGGTGACGG TGTGCAGGCG GCCTTCGAAG GCGCGGAGCG CCGGCTCATG
CTCACCGGCA CGCCGTTCCG TTCCGATGAC AACCCGATTC CGTTCGTCAG CTACGAGCGG
GGCGGCGACG GCCTGCTGCG CTCCCGCGCC GACTCTGTCT ACGGCTATGC CGACGCGCTG
CACGACGGCG TGGTCCGCCC GGTGCTCTTC CTCGCCTACT CGGGGGAGAC CCGGTGGCGG
ACGAATGCCG GTGAGGAACT GGCCGCCCGG CTCGGCGAGC CGATGACCCA GGATCTGATC
GCGCAGGCGT GGCGGACCGC GCTCGACCCG GCCGGCGACT GGATGCCGCA GGTGCTGCGG
GCGGCGGACG CCCGGCTGAC CGTGCTCCGC AACGCCGGGA TGCCCGACGC CGCCGGCCTG
GTGATCGCCA GCGACCAGCA GACCGCCCGT TCGTACGCGA AGCTCATCGA GCAGGTGACC
GGCGAGAAGG CCGCCGTGGT GCTCTCCGAC GACGCGGGTG CCTCGGCCCG GATCGCGACG
TTCGCGACCG CCGAGCAGCG TTGGCTGGTG GCGGTCCGGA TGGTTTCCGA AGGCGTGGAC
ATCCCCCGTC TCGCCGTCGG TGTCTACGCC ACCAGCGCCA GCACCCCGCT CTACTTCGCC
CAGGCCATCG GGCGGTTCGT CCGGGCACGG CGGCCGGGGG AGACGGCATC GGTGTTCCTG
CCCAGCGTCC CACACCTGCT CGGCCTCGCC AGCGAGATGG AAGCCGAGCG GGACCATGTG
CTGGGTAAAC CGAAGGACCA GGACGGTTTC GACGACGACC TGCTGGAGCG CGCCCAACGG
GACGACCAGG CCAGCGGTGA ACTGGAGAAG CGGTACGCCG CGCTCTCCGC GACCGCCGAG
CTGGATCAGG TGATCTTCGA CGGCGCGTCG TTCGGCACCG CTGCCCAGGC CGGTACGCCC
GAGGAGGAGG AGTATCTCGG CCTCCCCGGG CTGCTCACCG CCGACCAGGT GGCCATGCTG
CTGTCCAAGC GGCAGGCCGA GCAACTGGCC GCGTCGCGGC GCAGGACCGC CGCCCGGCCC
GTTGAACCGG CCGCGACGAC CGCGCCACCG GCACCGATGA GTGCGGCGCA ACGCCGGGTG
GCACTGCGCC GACAGTTGAA CGCCCTGGTG GCCGCCCGAC ACCACCACAC CGGTCAGCCG
CACGGCAAGA TCCACGCAGA GCTGCGCCGC CGCTGCGGCG GCCCGCCCAG TGCCCAGGCG
ACGATCGAGC AGCTGGAGGA ACGGATCGCC ACGGTGCAGA CCCTCTGA
 
Protein sequence
MAARTPTLDT FPALRAWQRR ALVEYLRRRE PDFTAVATPG AGKTTFALRV AAELLADGSV 
EAVTVVAPTE HLKAQWAQAA ARVGIQLDAA FRNADLHSSA DFHGAVVTYA QVGMAPQVHR
RRTLTRRTLV VLDEIHHAGG SRTWGDGVQA AFEGAERRLM LTGTPFRSDD NPIPFVSYER
GGDGLLRSRA DSVYGYADAL HDGVVRPVLF LAYSGETRWR TNAGEELAAR LGEPMTQDLI
AQAWRTALDP AGDWMPQVLR AADARLTVLR NAGMPDAAGL VIASDQQTAR SYAKLIEQVT
GEKAAVVLSD DAGASARIAT FATAEQRWLV AVRMVSEGVD IPRLAVGVYA TSASTPLYFA
QAIGRFVRAR RPGETASVFL PSVPHLLGLA SEMEAERDHV LGKPKDQDGF DDDLLERAQR
DDQASGELEK RYAALSATAE LDQVIFDGAS FGTAAQAGTP EEEEYLGLPG LLTADQVAML
LSKRQAEQLA ASRRRTAARP VEPAATTAPP APMSAAQRRV ALRRQLNALV AARHHHTGQP
HGKIHAELRR RCGGPPSAQA TIEQLEERIA TVQTL